Extensions of probability-preserving systems by 
measurably-varying homogeneous spaces and 

applications 



Tim Austin 



Abstract 

We study a generalized notion of a homogeneous skew-product exten- 
sion of a probability-preserving system in which the homogeneous space fi- 
bres are allowed to vary over the ergodic decomposition of the base. The 
construction of such extensions rests on a simple notion of 'direct integral' 
for a 'measurable family' of homogeneous spaces, which has a number of 
precedents in older literature. The main contribution of the present paper 
is the systematic development of a formalism for handling such extensions, 
including non-ergodic versions of the results of Mackey describing ergodic 
components of such extensions [34], of the Furstenberg-Zimmer Structure 
Theory If50ll49ll23]| and of results of Mentzen ifJTl describing the structure of 
automorphisms of such extensions when they are relatively ergodic. We then 
offer applications to two structural results for actions of several commuting 
transformations: firstly to describing the possible joint distributions of three 
isotropy factors corresponding to three commuting transformations; and sec- 
ondly to describing the characteristic factors for a system of double noncon- 
ventional ergodic averages (see (jU and the references listed there). Although 
both applications are modest in themselves, we hope that they point towards 
a broader usefulness of this formalism in ergodic theory. 



2000 Mathematics Subject Classification: Primary 28D15; Secondary 37A30, 37A45. 
Key words and phrases: Probability-preserving system, skew product, isometric extension, er- 
iic decomposition, Mackey Theory, Furstenberg-Zimmer Theory. 



1 



Contents 



1 Introduction 

2 Background and notation 10 

2.1 Measurable functions and probability kernels 10 

2.2 Systems, subactions and factors 12 

2.3 Measurable selectors 14 

2.4 Background from group theory [16 

3 Direct integrals of homogeneous-space data |l7 

4 Mackey Theory in the non-ergodic setting 22 

4. 1 Ergodic decompositions and Mackey group data 23 

4.2 More general lifted measures on homogeneous space extensions . |30 

5 Relative weak non-mixing and isometric extensions of non-ergodic sys- 
tems 34 

5.1 Generalized eigenf unctions and finite-rank modules 35 

5.2 The non-ergodic Furstenberg-Zimmer inverse theorem |36 

6 Factors and automorphisms of isometric extensions 41 

6.1 Some more notation 41 

6.2 The structure theorems |44 

7 Applications 50 

7.1 Application to joint distributions of isotropy factors 51 

7.2 Reduction to compositions of isotropy factors 54 

7.3 Some isometric extensions and their associated Mackey data ... 55 

7.4 Application to characteristic factors [72 

8 Further questions |79 



1 Introduction 

This work is concerned with probability-preserving actions T : T r\- {X, jj) of 
locally compact second countable topological groups on standard Borel probability 
spaces. We often denote such an action by (X, /i, T) if the group is understood. 

One of the more versatile constructions by which a more complicated system may 
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be built from comparatively simple ingredients is the homogeneous skew-product 
(see, for example, Examples 2.21 in Glasner I291 ). From some given F-system 
(y, I', S), a compact group G and a closed subgroup K < G, and a measurable 
cocycle p : F x y ^ G for the action S, we form the system {Y x G/K,v ® 
mc/K^T) by setting 

T^y, gK) := {S^y, p{j, y)gK) for 7 G F. 

A well-developed theory of such systems is available in case the base system 
(y, V, S) is ergodic (much of which can be extended to the setting in which v is 
only quasi-invariant under 5; see IH). In addition to providing a wealth of exam- 
ple systems, such homogeneous skew-products over ergodic base systems acquire 
a greater significance through the structure theory developed by Zimmer in [50. 49l 
and Furstenberg in [23 1. This is concerned with the failure of relative weak mix- 
ing of extensions (see, for example. Definition 9.22 in Glasner [[29l ). Relative 
weak mixing is a strengthening of the condition of relative ergodicity which has 
numerous consequences for how this extension may be joined to others. An under- 
standing of these consequences and of the ways in which relative weak mixing can 
fail is crucial to Furstenberg's approach to Szemeredi's Theorem ( [[23l : see also 
the excellent treatment in [24]). The core result of Furstenberg and Zimmer is an 
inverse theorem according to which an extension of ergodic systems fails to be rel- 
atively weakly mixing if and only if it contains a nontrivial subextension that can be 
coordinatized as a homogeneous skew-product: thus, homogeneous skew-products 
are identified as precisely the 'obstructions' to relative weak mixing within other 
extensions. 

However, in many applications in which this ergodicity of the base system fails, 
this simple homogeneous skew-product construction is not quite general enough, 
and the Furstenberg-Zimmer theory outlined above is not available without modi- 
fication. 

In this paper we shall extend the definition of homogeneous skew-product to a 
more general class of systems by the simple artifice of allowing the fibre Gy/ Ky, 
in addition to the cocycle p{y), to vary as a function of the base point y . This 
leads to a definition of an extension of {Y, v, S) given as an action of the group on 
a 'direct integral' of homogeneous space fibres over {Y, u). It is relatively simple 
to settle on a definition of 'measurability' for such an assignment of fibres, and to 
turn this idea into a rigorous definition. 

The study of such measurably-varying groups is certainly not new. It is already 
alluded to during the introduction to Section 6.1 of Guichardet's book ll30l in the 
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context of 'measurable current groups' and their representation theory, motivated 
in turn by considerations from algebraic quantum field theory. A number of more 
recent works have studied constructions of this nature explicitly. For example, 
in |[T6l Conze and Raugy analyze the ergodic decomposition of various a-finite 
measures associated to an extension of a non-singular ergodic base transforma- 
tion by a locally compact non-compact group, invoking for their description a 
measurably-varying family of subgroups of that fibre group. However, in their 
setting the need for a measurably-varying family of groups is related to the pos- 
sibly non-smooth structure of the Borel equivalence relation of conjugacy on the 
collection of all such subgroups, an issue which disappears upon restriction to the 
case of compact fibre groups, and so the results that they develop are still rather dis- 
connected from the present paper. Perhaps closest to our present emphasis is the 
emergence of measurably- varying subgroups of a fixed compact group in the anal- 
ysis of measurably-varying Mackey groups for certain non-ergodic self-joinings of 
an ergodic system, which underlies some known approaches to the study of non- 
conventional ergodic averages; see, in particular. Section 3.2 in Meiri [36], the 
proof of Lemma 9.2 in Furstenberg and Weiss [28] and Section 6.8 in Ziegler [48]. 

More generally, a need to extend known machinery for different kinds of ergodic 
system to their non-ergodic relatives has been felt in other areas: consider, for 
example, Downarowicz' discussion in ITTl of 'assignments' for his study of sim- 
plices of invariant measures for topological systems on zero-dimensional metric 
spaces, and the work of Fisher, Witte-Morris and Whyte on cocycle supenigidity 
for non-ergodic systems ||T9l . The careful study of such non-ergodic settings also 
has many parallels with the analysis of direct integrals in the infinite-dimensional 
representation theory of locally compact groups or von Neumann and C* -algebras 
(nicely introduced, for instance, in Arveson [1]), although we will not explore this 
connection further here. 

Notwithstanding the diversity of these previous developments, within structural 
ergodic theory the treatment of extensions with varying homogeneous space fi- 
bres seems to have stayed largely immersed in other analyses, such as those cited 
above. Although it is intuitively clear that the fundamental structural results for 
the ergodic case of homogeneous skew-product extensions should admit natural 
generalizations, it seems that this not yet been carried out. In fact, after setting 
up the right definitions we will find that it is largely routine to extend both the re- 
sults of Mackey ([34]) on the invariant factor of the extended system and also the 
Furstenberg-Zimmer Theory to this setting. We lay out the details of this general- 
ization in the first part of this paper. More interesting is the extension of the results 
of Mentzen ll37l on the possible structure of an automorphism of an isometric ex- 
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tension of ergodic systems: we generalize this by presenting structure theorems for 
factors and automorphisms of a relatively ergodic extension by measurably- varying 
compact homogeneous spaces. 

Although this generalization as much a matter of care as new ideas, it pays off 
by broadening the applicability of the theory of homogeneous skew-products to 
settings in which an assumption of base ergodicity is unavailable. This arises, in 
particular, when considering an action of a larger group T : T rv {X, jj) restricted 
to some subgroup A < F. Although a routine appeal to the ergodic decomposition 
can often justify the assumption that T is ergodic overall, if we disintegrate further 
to guarantee that the restricted action T is ergodic then T''' for 7 G T \ A need not 
preserve the resulting disintegrands of jj,. As a consequence, if we are concerned 
with how the A-subaction sits within the whole original action, we may be forced 
to retain a system for which this subaction of A is not ergodic. 

In this paper we offer two closely-related applications meeting this description. For 
both cases we specialize to F = l/. These two applications are relatively simple, 
and are included largely to illustrate the arguments made possible by the formaUsm 
described above, but they also exemplify much more general questions on which 
we suspect these methods will shed light in the future. 

Given a Z'^-system X = {X, fi, T) we can consider the cr-subalgebra S^^^ of sets 
left invariant by the subaction of T corresponding to some subgroup F < 1'^. As 
is standard in the category of standard Borel spaces, this can be identified up to 
/x-negligible sets with the cr-algebra generated by a factor map Q^^^ : X ^ Z^^^ 
to some new system on which the subaction of F is trivial. Although individually 
these new systems can still be quite complicated, a possibly more tractable task is to 
describe their possible joint distributions within the original system. If F, A < 
are two subgroups then it is easy to show that Cj^^ and Qq^'^ are relatively indepen- 
dent over Co^''^+''\ but for three or more subgroups matters become more compli- 
cated. Clearly given three subgroups Fi, F2, Fs < we have that Cj'"^^ and C(f '"^^ 
T^t(ri +r2) 

both contain , and similarly for other pairs, and so a naive candidate for 

a generalization of the above result could be that the three isotropy factors * 

KT'-j-p ■) f(r'-i-r ) 

are relatively independent over the smaller triple of factors ' ^ V C(f ' 

t{r'-i-r ■) |"fr'-i-r ) 

(the factor generated by ' ^ and^o^ ' together) as /c) ranges over 
permutations of (1, 2, 3). If we denote the target Z'^-system of this joint factor map 
by Wi (so this is a joining of z^''"^'^^'^ and Z^''^'+^'^) and let : Zl^""' Wj 
be the factor map defined by ^^^'^ ^' V ^T'tC^i+rj.) _ q,^ q^t'""* ^ these factors 
are arranged as in the following commutative diagram: 
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,Tf(ri+r2) 

-■o 



,rKri+r3) 
■'0 



j,t(r2+r3) 
■'0 



In fact the naive conjecture that the factors Co^^^' are relatively independent over 
their further factors is false, but 'not by very much' : we will see that it can fail 
only in a very restricted way. In general, the three factors Co^ '^' are relatively inde- 
pendent over some subextensions of these 'natural candidate' factors ttj, and these 
subextensions can be coordinatized by measurable compact fibre groups subject to 
certain further restrictions. 

Here we will examine this when d = 3 and Fj is the cyclic subgroup Ze^ in the 
direction of a basis vector ej, but it seems clear that our methods can be extended 
both to more general subgroups of Abelian groups and (probably with considerably 
more work) to larger numbers of subgroups. 

Theorem 1.1 (Joint distributions of three isotropy factors). Let X = {X, n, T) 

be a 1? -system and write Ti := T^* for i = 1, 2, 3. Let Wj be the target of the 

joint factor map ai := Co" V Co*' > where Cq" ^ Co > 

underlying standard Borel space and let Tj be the restriction ofTj to the factor 

Oil. 

Between the single isotropy factors CrT' : X Zq* and the smaller factors ai : 



X — > Wj there are three intermediate factors 



X^Z^^ 



V,; 



oCo"^^:X 



Vj, where 



such that 



• the triple of factors Co ' is relatively independent over the triple oCq ' under 

iJ-; 

• there exist compact metrizable group data Gi^, on Wi invariant under the 
restriction of the whole action T to the factor space Wi, a cocycle Tij : Wi — > 
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invariant under the restriction of to Wi and a cocycle Tik : Wj 
Gi^a invariant under the restriction ofTj to Wi such that we can coordinatize 
the extension Vj Wj as the extension ofWi by the measurable compact 
fibre groups Gi^, with the lifted actions defined by 

and 

We will generally denote a coordinatization of the extension Vj Wj as above 
by the commutative diagram 

Vi^ IX (Gj,,,mG,_.,rij-,r°P) 

canonical 

where we use the superscript °p to denote a cocycle that acts on fibres by right- 
multiplication, and we have suppressed mention of the transformation Tj since by 
definition its restriction to Yj is the identity. 

Although our final conclusion here yields measurably-varying fibre groups Gi^, 
that are invariant under the whole action T — and so would be constant if we 
had assumed that the overall action T is ergodic — the analysis leading to this 
conclusion will go via homogeneous space fibres of possibly greater variability, 
for the reason described earlier that at first we will only be able to assume that the 
fibres are invariant under the subaction Tj. 

The same is tr ue of our second application. This is to a special case of the problem 
of describing the 'minimal characteristic factors' for the nonconventional ergodic 
averages 

N d 
n=l i=l 

associated to a d-tuple of commuting actions Ti : r\ {X, fi) and functions 
fi, f2, ■ ■ ■ , fd £ L'^il^)- Let us write X = {X, fi, T) for the Z'^-system given by 
these one-dimensional actions in the coordinate directions. 

The question of convergence in for such averages was first settled when 

d = 2 by Conze and Lesigne in |fT3l . and since then a number of other works 
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have addressed other versions or relatives of this question B6l 12811311 . culminating 
in Host and Kra's detailed analysis of the case in which = T* for some fixed 
T in 1321 (see also Ziegler 1*481) and Tao's recent proof in p43l of convergence 
for arbitrary d. We direct the reader to [4| for a more detailed discussion of this 
problem and an alternative proof of convergence. 

Here we will consider in the case d = 2 an important part of these developments: 
the theory of 'characteristic factor- tuples' for such averages. In our setting, a pair 
of factors : X ^ Yj is 'characteristic' if 

N N 

^D/i°^")-(/2or2")--j;E^(/ioTr|6)-E^(/2oT^|6)-0 

71=1 n=l 

in L'^ifi) as ^ oo for any /i,/2 G L°°{iJ,). Clearly given such a pair of 
factors, the problem of proving convergence reduces to the case when each fi 
is ^j-measurable, and this reduction forms an important first step in many of the 
known proofs of convergence. Although characteristic factors are well-understood 
in some special cases, the more recent proofs of general convergence in Il43ll44l l4l 
proceed by first heavily modifying the original system and only then asking after 
the characteristic factors (or their finitary analog in Tao's proof in BBI ). and so our 
knowledge of the characteristic factors of the original system remains incomplete 
except in some special cases B6l [32l l20l l48l . More is known in the case d = 2 
from the work of Conze and Lesigne |[T3l . and in addition the following very pre- 
cise description of the characteristic factors when d = 2 has achieved folkloric 
currency since that work appeared. However, a complete proof seems to be sur- 
prisingly subtle, and we shall give such a proof as our second application of our 
non-ergodic machinery for extensions by homogeneous spaces. 

Theorem 1.2 (Characteristic factors for double nonconventional averages). Given 
a Z,'^ -system X = (X, fi, Ti, T2), let Wj be the target system of the factor ai := 

(^g ' V Co ^ with underlying standard Borel space Wi. Then X admits a charac- 
teristic pair of factors : X — > Yj with underlying standard Borel spaces Yi that 
extend the factors X Wj and can be described as follows: there are 

• a T -invariant measurable family of compact fibre groups G„ 

• a Ti-ergodic cocycle a : Zq ^ ^ G, that is ergodic for the restricted 
action ofTi, 

• and a pair of cocycles Ti : Z^^~' — > G, ergodic for the restricted action of 
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such that we can coordinatize these probability spaces as 

^(Wi,(ai)#^) x (G,,mG.) 

ranonical map 

(VFi,(ai)#/x) 
so that the restricted actions are given by 

restriction ofTi: {wi,g) ^ {Ti\ai {wi), a{CQ^'^'^ {wi)) ■ g), 
restriction o/T2.- {w,g) ^ {T2\aAwi),a{Co ^ {wi)) ■ 9 ■ MCo"^ (.wi))) , 
and similarly 

{Y2, ^ {W2, (a2)#/i) X (G,, mc.) 

!anonical map 

with 

restriction ofTi: {w2,g) ^ (Ti\a2{w2),a{C,Q ^ {W2)) ■ g ■ ti{C,q^ {W2))) , 

restriction ofT2: {w2,g) {T2\a2 (''«2), o-(C(f'^^ {W2)) ■ g). 

We suspect that our methods should extend to offer at least some description of 
characteristic factor-tuples for larger numbers of commuting transformations, al- 
though we also suspect that it will become rapidly more complicated. 

In summary, the body of this paper is organized as follows. 

In Section |2] we recall some definitions and standard results from group theory, 
measure theory and ergodic theory that we will need later in the paper, and in 
doing so set up some convenient notation. 

Section [3]introduces our definitions of measurable families of homogeneous space 
data and their direct integrals. 
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In Section |4] we cover quite briskly the main definitions and results of the non- 
ergodic Mackey Theory, and then in Section [5] we treat similarly the non-ergodic 
version of the Furstenberg-Zimmer inverse theory. 

In Section[6]we pursue a slightly less standard consequence of the Mackey Theory, 
using it first to describe the possible factors and groups of automorphisms of an 
extension by homogeneous space data, and then translating this into conditions 
on an automorphism of a base system that it be liftable to an automorphism of 
an extension. This generalizes the classical work of Mentzen f3T| in the case of 
ergodic systems, and will be important for the applications of the theory that follow. 

In Section|7]we present our two applications, to the joint three-fold distributions of 
isotropy factors and to double characteristic factors. 

Finally, in Section[8]we discuss some further possible applications of this machin- 
ery. 

Acknowledgements My thanks go to Vitaly Bergelson, John Griesmer, Bernard 
Host, Keith Kearnes, Bryna Kra, Alexander Leibman, Terence Tao and Tamar 
Ziegler for several helpful discussions and communications, and to the Mathemat- 
ical Sciences Research Institute (Berkeley) 2009 program on Ergodic Theory and 
Additive Combinatorics, during which large parts of this work were completed. 

2 Background and notation 

2.1 Measurable functions and probability kernels 

We will work exclusively in the category of standard Borel probability spaces 
{X, Sx, /u), and so will often suppress mention of their cr-algebras. 

Any Borel map (p : X Y specifies a u-subalgebra of in the form of 
(/)^^(Sy). Two such maps (p : X Y and ip : X ^ Z aie equivalent if these 
o"-subalgebras of Sx that they generate are equal up to ^-negligible sets, in which 
case we shall write (f) tp; this clearly defines an equivalence relation among 
Borel maps with domain X. As it standard, in the category of standard Borel 
spaces equivalence classes of such Borel maps are in bijective correspondence with 
equivalence classes of cr-subalgebras under the relation of equality modulo the a- 
ideal of /^i-negligible sets. A treatment of these classical issues may be found, for 
example, in Chapter 2 of Glasner ll29l . 

A measure-preserving Borel map tt : {X, fi) {Y, u) contains another such map 
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Tp : {X, fx) — > (Z, 9) if 7r^^(Sy) ^ V^^(^z) up to /x-negligible sets. In this case 
we shall write vr ^ ^ or ^ ;^ vr, and sometimes that ij^ is //-virtually a function 
of (/) or that it is /i-virtually (/)~^(Sy)-measurable. It is again a classical fact that 
in the category of standard Borel spaces this notion of containment is equivalent 
to the existence of a factorizing Borel map (p : {Y, v) —>■ (Z, 9) with ip = (j) o ir 
/i-almost everywhere, and that a measurable analog of the Schroeder-Bemstein 
Theorem holds: vr ~ if and only if in each direction such a (p may be chosen 
that is invertible away from some negligible subsets of the domain and target. It 
is clear that (up to set-theoretic niceties) this defines a partial order on the class of 
~-equivalence classes of Borel maps out of the given space {X, fi). 

Measure-respecting Borel maps from one probability space to another comprise 
the simplest class of morphisms between such spaces, but in this paper we shall 
sometimes find ourselves handling also a weaker class of morphisms. Suppose 
that Y and X are standard Borel spaces. Then by a probability kernel from Y to 
X we understand a function P :Y x [0,1] such that 

• the map y i— > P{y, A) is Sy-measurable for every A € Sx; 

• the map A i—^ P{y, A) is a probability measure on T,x for every y GY. 

Intuitively, such a kernel amounts to a 'randomized map' from Y to X: rather than 
specify a unique image in X for each point y G Y, it specifies only a probability 
distribution P{y, • ) from which a point of X could be chosen. The first of the 
above conditions is then the natural sense in which this assignment of a probability 
distribution is measurable in y; indeed, a popular alternative definition of probabil- 
ity kernel is as a measurable function from Y to the set Pr X of Borel probability 
measures on X. In ergodic theory this notion lies behind that of a 'quasifactor' 
(which assumes also a certain equi variance of this map): see, for example. Chapter 
8 of Glasner 1,29,1 . where this alternative convention and notation are used. We will 
write P : Y ^ X when P is a probability kernel from Y to X. 

Given a kernel P : Y ^ X and a probability measure v on Y, we define the 
measure P^^u on X by 



this measure on X can be interpreted as the law of a member of X selected ran- 
domly by first selecting a member of Y with law u and then selecting a member of 
X with law P{y, ■ ). By analogy with the case of a function between measurable 
spaces, we will refer to this as the pushforward of v by P. This extends standard 
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deterministic notation: given a measurable function (/> : y ^ X, we may associate 
to it the deterministic probability kernel given by P(y, • ) = ^^(j^) (the point mass 
at the image of y under (/)), and now P^u is the usual push-forward measure (fi^v. 

Certain special probability kernels naturally serve as adjoints to factor maps, in the 
sense of the following theorem. 

Theorem 2.1. Suppose that Y and X are standard Borel spaces, that fi is a prob- 
ability measure on X and that (/) : X ^ Y is a measurable factor map. Then, 
denoting the push-forward by v, there is a v-almost surely unique probability 
kernel P : Y ^ X such that fj. = P^u and which represents the conditional 
expectation with respect to 4>: for any f G L^ifJ-), the function 

xi^ I f{x) P{(t){xi),dx) 
Jx 

is a version of the ji-conditional expectation of f with respect to 0~^(Sy). 

We also write that this P represents the disintegration of fi over (j). A general prob- 
ability kernel P : Y ^ X represents the disintegration over (j) of some measure 
that pushes forward onto v if and only if ji\ P{x, ■ ) i^{dy) and P{y, ■ ) z^(dy) 
are mutually singular whenever A D B = (J). 

Proof See Theorem 6.3 in Kallenberg ||33l. □ 
2.2 Systems, subactions and factors 

In this paper we shall spend a great deal of time passing up and down from systems 
to extensions or factors. Moreover, sometimes one system will appear as a factor of 
a 'larger' system in several different ways (most obviously, when we work with a 
system that appears under each coordinate projection from some self-joining). For 
this reason the notational abuse of referring to one system as a factor of another 
but leaving the relevant factor map to the understanding of the reader, although 
popular and useful in modern ergodic theory, seems dangerous here, and we shall 
carefully avoid it. In its place we substitute the alternative abuse, slightly safer in 
our circumstances, of often referring only to the factor maps we use, and leaving 
either their domain or target systems to the reader's understanding. Let us first set 
up some notation to support this practice. 

If r is a locally compact second countable topological group, by a P-system (or, if 
r is clear, just a system) we understand a jointly measurable probabihty-preserving 
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action T : r rv (X, /j,) on a standard Borel probability space. We will often al- 
ternatively denote this space and action by {X, T), or by a corresponding single 
boldface letter such as X. If A < T we denote by T^"^ : A r\ {X,fi) the ac- 
tion defined by (T^^)^ := T'^ for 7 G A, and refer to this as a subaction, and if 
X = {X, /Lt, T) is a F-system we write similarly Xl^^ for the system {X, n, T^^) 
and refer to it as a subaction system. 

A factor from one system {X, /x, T) to another {Y, v, S) is a Borel map ir : X 
with TT^n = u and vr o T = S" o vr. Given such a factor, we sometimes write TItt to 
denote the action S with which T is intertwined by vr. 

In this paper, given a globally invariant cr-subaglebra in X, a choice of factor 
TT : X ^ Y generating that a-subalgebra will sometimes be referred to as a coor- 
dinatization of the cr-subalgebra. Importantly for us, some choices of a coordina- 
tizing factor tt may reveal some underlying structure more clearly than others, and 
so we will sometimes need to pass between coordinatizing factors. Given one co- 
ordinatization tt : X — > Y and an isomorphism ^ : Y ^ X, we shall sometimes 
refer to the composition V' o tt as a recoordinatization of tt. We will also extend 
this terminology to that of coordinatizations and recoordinatizations of families 
of factors of a system in the obvious way in terms of the appropriate commutative 
diagram of isomorphisms. 

Given a F-system X = {X,fj,,T), the tj-algebra 11^ of sets A S Sx for which 
IJ:{AAT^{A)) = for all 7 G F is T-invariant, so defines a factor of X. More 
generally, if F is Abelian and A < F then we can consider the cr-algebra S^^^ 
generated by all Tl^^-invariant sets: we refer to this as the A-isotropy factor and 
write Zq^ for some new system that we adopt as the target for a factor map Co^^ 
that generates T,x , and Zq for the standard Borel space underlying Zq . Note 
that in this case the Abelianness condition (or, more generally, the condition that 
A < F) is needed for this to be a globally T-invariant factor. If Ti and T2 are 
two commuting actions of the same Abelian group F on {X, ^) then we can define 

a third action TiTg"^ by setting {TiT:^'^)'^ := T^T^~ , and in this case we may 
write Cji=^2 . X ^ Z|^i=^2 pia^g of ^^iTj-i . ^ ^ Zq'^^\ If 5 C F and a 

is the group generated by S, we will sometimes write Zq in place of Zq^^, and 
similarly. 

An important construction of new systems from old is that of relatively inde- 
pendent products. If Y = (y, v, S) is some fixed system and ttj : Xj = 
{Xi, ^i,Ti) — > Y is an extension of it for i = 1,2, ... ,k then we define the 
relatively independent product of the systems Xj over their factor maps ttj to be 
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the system 

J] Xi = ( J] Xi, (g) /ii,Ti X T2 X • • • X Tfc) 

{711=772 = . ..=7rfe} {771=772 = . .. = 7rfe} {771 =772 = . . .=77fe } 

where 

:= {(xi,X2,. . . e Xi X X2 X ■ ■ • X Xfc : 

{771=772 = ... =77 (;} 

7ri(xi) = 7r2(x2) = . . . = ■Kk{xk)}, 

{771=772 = . .. = 77fc} ^ i = l 

and Pi-.Y ^ Xi is a probabiUty kernel representing the disintegration of Hi over 
vTj. In case A; = 2 we will write this instead as Xi x^^^^^^y X2, and in addition if 
Xi = X2 = X and vri = 7r2 = vr then we will abbreviate this further to X x X, 
and similarly for the individual spaces and measures. 



2.3 Measurable selectors 

At several points in this paper we need to appeal to some basic results on the 
existence of measurable selectors, often as a means of making rigorous a selection 
of representatives of one or another kind of data above the ergodic components of 
a non-ergodic system. 

Theorem 2.2. Suppose that {X, Sx) and (Y, Sy) are standard Borel spaces, that 
A X is Borel and that ir : X ^ Y is a Borel surjection. Then the image it {A) 
lies in the v"^ -completion ofTiyfor every Borel probability measure u on (y, Sy) 
with completion v'^, and for any such v there is a map f : B ^ A with domain 
B eHy such that B C ^{A), v''{'k{A) \B) = and n o f = ids. □ 

Proof See, for example, 4230 and its consequence 424X(h) in Fremlin f22l. □ 

Definition 2.3 (Measurable selectors). We refer to a map f as given by the above 
theorem as a measurable selector for the set A. 

Remark We should stress that this is only one of several versions of the 'measur- 
able selector theorem', due variously to von Neumann, Jankow, Lusin and others. 
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Note in particular that in some other versions a map / is sought that select points 
of A for strictly all points of vr(A). In the above generality we cannot guarantee 
that a strictly-everywhere selector / is Borel, but only that it is Souslin-analytic 
and hence universally measurable (of course, from this the above version follows 
at once). On the other hand, if the map vr^ is countable-to-one, then a version of 
the result due to Lusin does guarantee a strictly-everywhere Borel selector /. This 
version has already played a significant role in our corner of ergodic theory in the 
manipulation of the Conze-Lesigne equations (see, for example, lITBlllSlfTTl ). and 
so we should be careful to distinguish it from the above. A thorough account of all 
these different results and their proofs can be found in Sections 423, 424 and 433 
of Fremlin Ea. < 

In the right circumstances it is possible to strengthen Theorem l2.2l to obtain a Borel 
selector that is invariant under a group of transformations, by making use of a 
coordinatization of the invariant factor. 

Proposition 2.4. Suppose that (X, Tix) and (Y, Sy) are standard Borel spaces, 
A C X is Borel and tt : X ^ Y is a surjective Borel map, and in addition that 
T : T r\ (X, Sx) is a jointly measurable action of a locally compact second 
countable group such that vr is a factor map, so it o = S'^ o nfor some jointly 
measurable action S : T r\ (y, Sy), and that A is T -invariant. Then for any 
S -invariant probability measure v on (y, Sy) with completion there are an 
S-invariant set B € Sy such that B C 'k{A) and v'^{ti{A) \ B) = and an 
S -invariant map f : B ^ A such that vr o / = id^- 

Proof Let Jq : Bq ^ Abe. an ordinary measurable selector as given by Theo- 
rem l2.2[ and let z^be any 5-invariant probability measure on (Y, Sy). This Bq must 
be z^-almost S'-invariant, simply because vr(^) is T-invariant and v'^{BAit{A)) = 
0. Using local compactness and second countability, let (-Fj)j>i be a countable 
compact cover of F, and also let be a left-invariant Haar measure on F. From 
the joint measurability of T it follows that the set 

B:={yeY: mr{7 € F : S^{y) e y \ Sq} = O} 

= fl {y G y : mr{7 G : S"'{y) G Bq} = mr(F,)} 
i>i 

is Borel, T-invariant and satisfies u{BqAB) = 0. 

We now let : {Y, Sy , u) — > {Z, Tjz,0) be any coordinatization of the invariant 
factor Sy ; it is easy to see that this may be chosen so that there exists some C G Hz 
such that B = (^^{C). We can now use B and C, to 'tidy up' our original selector 
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/o- Indeed, by the S-invariance of C and the fact that for every y G B we have 
S''{y) € -Bo for some (indeed, almost all) 7 G F, we must have C{Bo) 5 C. 
Therefore by applying the ordinary Measurable Selector Theorem a second time 
we can find a Borel subset D e^z with D C C and 6{C \D) = and a Borel 
section rj : D —>■ Bq such that C ° ">] = ids ; and so now replacing Bq with B and 
the map /o with f : y ^ /o(f?(C(y))) completes the proof. □ 

Definition 2,5 (Invariant measurable selectors). We refer to a map f as given by 
the above proposition as a T-invariant measurable selector for the set A. 

2.4 Background from group theory 

We collect here some standard group theoretic definitions and results for future 
reference. 

Definition 2.6 (Core). If G is a group and H < G we denote by CoTeQ{H) the 
core of H in G: the largest subgroup of H that is normal in G. It is clear that 
this exists and equals g^^ Hg. If G is compact and H is closed then so is 

CoieciH). 

If CoreciH) = {1g} we shall write that H is core-free in G. 

Definition 2.7 (Full one-dimensional projections; slices). Given two groups Gi 
and G2 and a subgroup M < Gi x G2, and writing VTj : Gi x G2 ^ Gi for the 
two coordinate projections, we say that M has full one-dimensional projections if 

TTiiM) = Gifori = 1,2. 

We refer to the subgroups 

Hi :=7ri(Mn(Gi x {IgJ)) 

and 

i?2:=^2(Mn({lGjxG2)) 
as the first and second slices of M respectively. 

It is a classical observation of Goursat (see, for example. Section 1.6 of Schmidt BTI ) 
that M has full one-dimensional projections and trivial first and second slices if and 
only if it is the graph of an isomorphism <I> : G2 — > G2. If the sUces are non-trivial, 
we do at least have the following. 

Lemma 2.8. If M < Gi x G2 has full one-dimensional projections then its slices 
satisfy Hi < Gifor i = 1,2. 
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Proof By symmetry it suffices to treat tfie case i = 1. Let ri S Gi. Since 
7ri(M) = Gi we can find r2 G G2 sucii tfiat (ri, G Af . It is now easy to cfieck 
tiiat 



(rf^(7, e) G M} 
(n,r2)(rfVe) G M} 
(5,^2) GM} 
{grY\e){n,r2) G M} 
(5rf\e) G M} = Hm. 



Since ri was arbitrary, Hi is normal, as required. 



□ 



Given a compact group G or one of its homogeneous spaces G/H we shall always 
consider it endowed with its usual Borel structure and Haar probability measure, 
which we shall denote by mc or rriQ/H- 

If [/ is a compact metrizable group then we write Clos U for its collection of closed 
subsets endowed with the Vietoris topology (see, for example, 2.7.20, 3.12.27 
and 4.5.23 of Engelking ifTSl : as is standard, this is also compact and metriz- 
able) and the associated standard Borel structure and Lat U C Clos U for the 
further Veitoris-closed subfamily of closed subgroups with its induced standard 
Borel structure. In this setting of subgroups of compact metrizable groups, the Vi- 
etoris topology is easily seen to coincide with the Fell topology and the Chabauty 
topology, both of which also commonly appear in the study of lattices of closed 
subgroups; see Subsection 2.1 of Conze and Raugy |16| and the references given 
there. This topology and Borel structure can be understood in terms of Haar mea- 
sures in the following standard way. 

Lemma 2.9. The Vietoris topology and measurable structure on Lat U coincide 
with the pullbacks of the vague topology and measurable structure under the Haar- 
measure map H ^ ^h- D 



3 Direct integrals of homogeneous-space data 

In this section we give the rigorous definition of a 'direct integral' of measurably- 
varying homogeneous spaces and of the lifted transformation acting on it, and es- 
tablish some of their elementary properties. We build such an extension X as a 
union of different fibres Gy/Ky above each y G F, the fibre actually depending 
only on Coiu) ^ we extend S to an action T on X using a cocycle con- 

strained to lie at (almost) every point in the relevant fibre. We enforce a suitable 
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measurable structure by drawing Gy and Ky from among the compact subgroups 
of some fixed 'repository' group, subject to the condition Ky < Gy, measurably 
for the Vietoris measurable structure on such subgroups. 

Definition 3.1 (Measurable homogeneous space data). Let Y be a standard Borel 
space and U a fixed compact metrizable group. By measurable compact group 
data on Y with fibre repository U we understand a map Y Lat U : y ^ Gy that 
is measurable for the Vietoris Borel structure on Lat U. We shall usually denote 
such a map by G„ and will often omit explicit mention of the fibre repository U. 
More generally, by measurable compact homogeneous space data on Y with fibre 
repository U we understand a pair (G,, K,) of measurable compact group data 
with repository U such that Ky < Gy for every y. We shall usually denote this 
pair instead by G,/K,, and think of it as a measurable assignment of the compact 
homogeneous space Gy/Ky to each point y gY. 

Definition 3.2 (Direct integral of measurable homogeneous space data). Given a 
standard Borel probability space {Y, u) and measurable compact homogeneous 
space data as above, we shall define their direct integral to be the subset 

{{y,gKy) : y e y, 5 G C y X Clos C/, 

which we denote byY k G,/K,. This is easily verified to be standard Borel for the 
relevant product measurable structure, and we will always assume it to be endowed 
with the restriction of that measurable structure. 

On this space we define the direct integral measure u x mQ^/j^^ by 

V X mciK, {A) := 5y® mcjKy [A n {{y} x Gy/Ky)) v{dy). 

Given another measurable assignment of subgroup data H, < G„ we define anal- 
ogously the direct integrals {Y x H,\G, , v x itih^^q^ ) of the spaces of right-cosets 
and (y X (H,\G,/ K,), v x Jri/f.yG./.ft:.) of the spaces of double cosets. 

Remark We will rarely remark again on the assumption that the fibre repository 
U be metrizable, but this will always be implicit. This ensures that the above con- 
struction keeps us within the category of standard Borel spaces (and it will be a nat- 
ural consequence of the non-ergodic Furstenberg-Zimmer theory applied to such 
spaces), and will occasionally be important for proofs (such as in Lemma [377] be- 
low). One could attempt to construct an extended theory that allows non-metrizable 
fibre groups and works instead in the larger category of perfect measure spaces (see 
342K of Fremlin tHJ), but we will not do so here. < 
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Definition 3.3 (Cocycle-sections). Suppose that {Y, v), U and G,/ K, are as above, 
that r is a locally compact second countable group and that S -.T r\ (y, v), and 
suppose further that the group data y i— > Gy and y i— Ky are S -invariant. Then 
a cocycle-section of G, over S is a measurable cocycle p : T x Y ^ U over 
S such that p{'y, y) G Gy for every 7 G F and y G Y. We shall denote such a 
cocycle-section by p : T x Y ^ G,. 

Remark Note that in the setting of a general locally compact second countable 
group r, the definition that p be a cocycle over S demands only that ^(7172, y) = 
p(7i , S'^^ y ) • p(72 , y ) for z^-almost every y G y for strictly every 71 and 72 (see, for 
instance, Section 4.2 of Zimmer |3T|), where the negligible set of 'bad' y is allowed 
to vary with (71,72); and that by convention two cocycles are equivalent if they 
agree z^-almost surely for strictly every 7. In view of this, we lose no generality in 
asking that ^(7, y) G Gy for strictly every y and 7, rather than for almost every y 
for strictly every 7, since in the latter case we may simply adjust p to equal 1 on 
the Borel set where is falls outside the specified repository, and this changes each 
/9(7, • ) on only a i^-negligible set for strictly every 7. <l 

Finally, we can define our class of extensions. 

Definition 3.4 (Extensions by measurable homogeneous space data). Suppose that 
Y = (y, u, S) and G,/K, are as above, that the group data y ^ Gy and y ^ Ky 
are S-invariant and that p : T x Y ^ G, is a cocycle-section over S. Then the 
extension ofY by the data (G,/K,, p) is the action TofTon {Y k G,/K,,u x 
mc/K.) given by 

T'''{y,gKy) := {S'^y, p{'j,y)gKy); 

it is routine to verify that this is measurable and measure-preserving. 

We will often denote this extended system by\ x {G,/ K,,mQ^/x,T P)- clearly 
admits Y as a factor simply by projecting out the fibre coordinate; we will refer 
to this as the canonical factor map. The data G,/K, and cocycle-section p are 
together relatively ergodic if the extension Yx {G,/ K,,mQ^iK, ■, p) —^Y through 
the canonical map is relatively ergodic. 

Remarks 1. In light of the Peter- Weyl Theorem (treated in most standard texts 
on compact group representations, such as in Section III.3 of Brocker and tom 
Dieck [12]) all compact metrizable groups can be realized isomorphically, albeit 
highly non-uniquely, as closed subgroups of a suitably large direct product of uni- 
tary groups, say U := Y[n>i U(n)^. This suggests that such a direct product 
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should suffice as a compact repository for all purposes, and indeed this can be 
proved with just a little work; however, this result seems to contribute little to the 
theory, and so we will not present it here. Note, however, that it is also precisely 
such direct products of unitary groups that will emerge naturally as repositories in 
the non-ergodic Furstenberg-Zimmer inverse theory of Section |5]below. 

2. In view of the condition that G, and K, are S'-invariant, given a coordina- 
tization : Y ^ of the 5-isotropy factor we could alternatively work with 
compact measurable group data defined initially as functions on the space Zq and 
then lifted through (q. We will occasionally use this alternative description when 
it is notationally convenient. <1 

The following related definition will also occasionally be useful. 

Definition 3.5 (Opposite extensions by measurable group data). IfY, G, and p 
are as above, then they also define an extended action T on {Y k G,,v k tuq, ) by 

this is the opposite extension ofYby the data {G,, p), and we will denoted this T 
byS^ /9°P. 

Remark In fact we always have 

Y K ^ Y X (G.,p°P) 




Y 



through the fibrewise isomorphism {y,g) i— > {y,g~^). The use of opposite exten- 
sions will matter to us in situations where we have two different actions on the 
extended space, one by a cocycle and one by an opposite cocycle. <l 

Before leaving this section, it is worth noting one way in which some redundancy 
in the above definition can be removed. 

Definition 3.6. Homogeneous space date G,/K, over {Y, v) is core-free if Ky is 

core-free in Gy almost everywhere. 

Lemma 3.7. Suppose Y := (y, z^, S") is a T -system, that G^/K, are measur- 
able S-invariant homogeneous space data on Y with repository U and that p : 
T X Y ^ G, is a cocycle-section over S. If in addition the group Ky is nor- 
mal in Gy for v-almost every y, then there are a fibre repository U', measurable 
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S -invariant group data G' on Y and a measurable S -invariant family of isomor- 
phisms : Gy/Ky — > G'y suck tkat the map {y,gKy) {y,^y{gKy)) defines 
an isomorphism of extensions 



Y 

with {-f,y) ^ p'{-f,y) := '^y{p{-f,y)) -.TxY ^G'. 

Proof This rests on the construction of the new fibre repository for the quotient 
groups G,/K,. For y £ Y let Sjy < L'^{mu) be the separable Hilbert subspace of 
square-integrable functions on U invariant under left-rotation by Ky. This is an S- 
invariant measurable family of separable Hilbert spaces (in the sense familiar from 
the analysis of group representations and von Neumann algebras; see, for instance, 
Mackey 135]), and so we can partition Y into 5-invariant measurable subsets Ai, 
A2, . .. , and for each n G N U {00} select an S'-invariant measurable family of 
isomorphisms <I>y : — > for y € An, where 9)'^ is some fixed n-dimensional 
reference complex Hilbert space and ij'i < — • • • — ^00 • 

Now let -Ky : Gy r\ for y £ An be the representation that results from first 
restricting the left-regular representation of Gy on L'^{mu) to S^y (which is possible 
when K, < G,, hence almost everywhere), then composing with ^y to obtain a 
representation on S^'n and finally extending this to act on S^'^ by acting trivially on 



This defines an S-invariant measurable family of representations vr^ of Gy for 
y G y outside some z^-conegligible subset, and such that Ky = kervTy almost 
surely. Next, it is easy to see that the decomposition of -Ky into finite-dimensional 
representations given by the Peter- Weyl Theorem is measurable in y (for example, 
since they may recovered as the spectral projections of each of a countable dense 
subfamily of all the measurable selections over y of vTy-invariant compact opera- 
tors on ijoo)- Hence this decomposition gives a measurable family of continuous 
homomorphic embeddings G, — > Y\n>i U(n)^ with kernels K,, and so letting 
G', be the image group data of these embeddings they define a measurable family 
of isomorphisms ^, such that defining p' as above and applying ^, fibrewise on 
Y x G,/K, gives the desired isomorphism of extensions. □ 

Corollary 3.8. T/'X = Y x {G,/K,,mQ^/x^, p) is an extension by homogeneous 
space data, then it is isomorphic (as an extension ofY through the canonical map) 
to an extension by core-free homogeneous space data. 



Y X {G,/K„mG./K.,P) 



YK{G'„mG',,p') 




^'00 ^'ri- 



ll 



Proof Let U be the repository and let Ly := Plgec 9 ^^y9 pointwise 
core of Ky in Gy. First observe that for any u £ U the set 

{{G,K): uGf] g-'Kg} = {{G,K) : K3gug-'ygGG} 
g&G 

it open in (Lat C/)^, since if {K, G) does not he in this set then there are a closed 
set Vi C [7 with nonempty interior and an open set V2 C [7 such that K n Vi = 0, 
G n 1/2 7^ and V2^i^2"^ ^ Vi- It follows that the map (Lat ^ Lat ?7 : 
{G,K) \—> f]g^Q g^^Kg is measurable, and hence that L, is measurable group 
data. 

Now by the preceding lemma we can select a measurable family of embeddings of 
the groups Gy/ Ly into a suitably-modified repository to obtain an isomorphism of 
systems 



Y K (G,/L,,mG./L.,p) 



Yx(G:,mGi,p') 



canonical 



'canonical 



corresponding to a continuous group isomorphism Gy/ Ly Gy at almost every 
y. Under these isomorphisms the subgroups Ky < Gy correspond measurably 
to some Ky < Gy so that {G'y,Ky) = {Gy/Ly, KyLy/Ly), and so observing 
from its definition that LyKy/Ly is always core-free in Gy/Ly, this completes the 
proof. □ 



4 Mackey Theory in the non-ergodic setting 

We will now move on to a more detailed analysis of extensions by homogeneous 
space data, and more specifically of their invariant factors and relatively ergodic 
measures. Many of the ideas that follow are nearly direct translates to our setting 
of those of Mackey in the case of an ergodic base system, and we will follow quite 
closely their treatment in Section 3.5 of Glasner |[29]| . 

In fact, more is true: earlier work on multiple recurrence and nonconventional er- 
godic averages has already encountered the possibility of a measurably-varying 
Mackey group within an extension of a non-ergodic base system by a fixed overall 
group. This technicality arises in the work of Meiri [36] on correlation sequences 
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arising from probability-preserving systems, of Furstenberg and Weiss ll28l on cer- 
tain polynomial nonconventional ergodic averages and more recently in Ziegler's 
approach in 1*481 to convergence of linear nonconventional averages for powers of 
a single transformation. For example, during the analysis in [28] a homogeneous 
skew-product extension of ergodic systems {X, /i, T) = {Y, v, S) x (G, uiq, p) is 
three-fold joined to itself, to give a measure on that is invariant for a transfor- 
mation of the form T^' xT'^ x but which is not ergodic for that transformation. 
This system is now coordinatized as an extension of an action on by the group 
and cocycle (G^, {p^'^\ p^^\ p^^^)), but since the base is no longer ergodic the de- 
scription of the ergodic components of the overall system requires the possibility 
that the Mackey group can vary among the closed subgroups of (a possibility 
that is then discounted by an argument showing that they are all actually conju- 
gate, and so may in fact be taken to be constant; we shall see a similar trick in 
S ubsection 17 .4l below). 

The only extra subtlety for which we must allow here is that the overall group G, 
now also varies measurably. This will require us to work rather harder in setting up 
the proof, although the overall idea is very similar to those mentioned above. For 
this reason, although we have included complete proofs here, we refer the reader to 
these other sources, and also Section 3.5 of Glasner ||29l . for relevant background. 

4.1 Ergodic decompositions and Mackey group data 

The Mackey Theory describes the invariant factor of an extension X = Y x 
{G,,mG,, p) in terms of the invariant factor of Y and the data {G,,mG,, p) of 
the extension. Here it will prove convenient to treat G, as varying over the fac- 
tor space Zq of ergodic components, lifted to Y for the purpose of defining the 
extended system. 

Theorem 4.1 (Mackey Theorem in the non-ergodic case). Suppose that X is the 
group-data extension Y k {G,,mG,, p) and that (q : Y ^ Zq is a coordi- 
natization of the base isotropy factor, and let it : X = Y x G, ^ Y and 
: Zq X G, — > Zq be the canonical factor maps. Then there are measurable 
subgroup data K, < G, on Zq, a T-invariant map 

: {X,p) ^ {Zlxl#^) X {K.\G.,mK.\G.) 
and a section b : Y ^ G, such that 

(1) (j) coordinatizes Zq and the following diagram commutes: 



Y K G, 

canonical 

Y — 



Zl K K,\G, 



canonical 



^0 



, 7S 
^0 



(2) (l){y,g) = (C^ {y),K^S(^y)b{y)g) for n-almost every {y,g); 

(3) the cocycle-section (7,?/) 1—^ b{S'^y)p{'j,y)b{y)~^ takes a value in Kg^s^y^ 
for v-almost every yfor every 7; 

(4) (Conjugate minimality) if K', < G, is another measurable assignment of 
compact subgroup data on Zq and b' : Y ^ G, another section such that 
the cocycle-section (7, y) b' {S'^ y) p{'y , y)b'{y)~^ takes a value in K'^g^^.^ 

for u-almost every yfor every 7, then there is a section c : Zq —^ G, such 
that 

c{s) ■ Ki ■ c(s)-i > 
for {(q) ^u-almost every s; 

(5) if P : Zq Y is a version of the disintegration of v over C,q, then the 
probability kernel {s,Ksg') ^^ P{s, •) x Jti^,)-!^:^^' is a version of the 
disintegration of p. over (j). 

Remark Clearly with hindsight we can take the property (2) above as defining 
(f); the point, however, is that we will obtain K, and first and then show that ^ 
takes this form for some b. < 

The proof of this theorem will require some initial constructions and an enabling 
lemma concerning the measurable selection of generic points. 

First let C(f : X ^ Zq be some coordinatization of the isotropy factor of the 
large system with the property that Co^ o tt factorizes through the natural factor 
^ : Zq^ — > Zq in the sense of the commutative diagram 
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(it is a standard fact that this is possible; see, for example, Section 2.2 of Glas- 
ner ||29l). 

We need a formal way to work with the action by right-multiplication of the fibre 
Gy on itself (the difficulty being, of course, that global constructions based on this 
pointwise-varying action need to kept measurable). To this end we define the map 

■■ X X|^s„^=e} {Z^ k G,) ^ X : {{y,g'), {s,g)) ^ {y,g'g); 

intuitively, T(j{x, {s, g)) gives the image of x under right-multiplication by g, which 
is an element of the group Gs over s = Cq{7:{x)) that acts on the fibre above tt{x). 
This map tq can be well-defined only for those tuples with s = Co^(7r(x)), hence 
the need for the relative self-product in the specification of its domain; however, 
with this restriction in place it is easily seen to be measurable. 

We can now work with the domain of tq as a system in its own right under the 
action (T x idzs^c.)- clear that the measure /i ^^^^s^^^gy (C(f#^ ^ ^G,) is 
(r X id^s t^Q^ ) -invariant and that tq itself is a factor map from the resulting system 
onto {X, ii,T). 

From Tq we now define the composition 

T : X X{^s„_,} {Zl X G.) ^ X ^ Zl 

Heuristically this assigns to the pair (x, (s, g)) the ergodic component of {X, fi, T) 
that contains the image of x under the right-multiplication by g acting on its fi- 
bre. Of course, the whole point is that different points within a single fibre of vr 
will generally lie in different ergodic components, and this map r reports on this 
dependence. 

Informally, the proof of Theorem 14.11 now proceeds by selecting a representative 
p{s) G X above each s € Zq and then defining Kg to be the subgroup of those 
g G Gs such that the T-ergodic component of does not change upon right- 
multiplication by g inside the 7r-fibre of s: that is, such that t{p{s), {s, g)) = 
Cq{p{s)). We need p to select points that are sufficiently 'generic' in the fibres 
above s, in the sense made precise by the following lemma. 

Lemma 4.2. In the setting of Theorem \4. 1\ we can find a Borel measurable section 
p : Zq — > X of the factor map Co ° such that the probability kernel 

P{s,A) ■.= mGA9 (^Gs-. r(p(s),(s,5)) G A G S^t, 

is a version of the disintegration of (^^fi over ^ : Z^ Zq. 
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Proof We first define a probability kernel P' : X ^ Zq by 

P'i{y,g'),A) := mG^s,j9 G : riiy,g'), iC^{y),g)) G A}. 

Intuitively this takes a point x = {y, g'), replaces it with an average over the fibre 
TT^^jy}, regarded as a copy of the fibre group G^^s^yy and then pushes the resulting 

probability measure down to Z^. 

It is easy to see that this satisfies the measurability conditions of a probability 
kernel, since we have 

P'i{y,g'),A) = P"{{y,g'),{C^)-\A)) 
where P" : X ^ X is the manifestly measurable probability kernel 

P"iiy,g'),-):= {5y ® mG^,J{-). 

In addition the above definition implies that 

P'i{y,g'), ■) = P'{{y,g'g), ■) = P'^S^^y, pij,y)g'), •) 

for any g E G(;^{y) ^^'^ 7 G F, firstly in view of the averaging over G^s^^) and 
secondly because we take the image under the invariant function C(f ■ Therefore 
the function x 1— > P'{x, ■ ), regarded as a Borel map from X to the space of Borel 
probability measures on Zq with its usual Borel structure, both factorizes through 
vr and is then S'-invariant. Therefore there is some P : Z^ ^ Zj such that 
P' = P o (^^ o IT, /i-almost surely. This P must be a version of the disintegration 
ofCo^fi over C 

Setting 

A,:={xeX: Co^(7r(x)) = s and P\x, ■ ) = P{s, • )}, 

this is now the set of 'generic points' above s from which we need to select p{s). 
From the above relation between P' and P we deduce that that P{s,As) = 1 for 
Co^-^i^-almost every s. In addition, since As is the section above s of the Borel set 

A := {{s,x) £ Z^ xX : Co{t^{x)) = sandP'{x, •) = P{s, •)}, 

by the Measurable Selector Theorem l2.2l we can choose a measurable selector p of 
Co ° such that {s,p{s)) E A for ^^_^i/-almost every s. This selector now has the 
properties claimed. □ 
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Proof of Theorem 14.11 (1) Given the measurable selector of the above lemma, 
define 

Ks := {5 G : t{p{s), {s,g)) = (^{p{s))]. 

It is clear from the definition of r that r(p(s), {s,g)) = Co{p{s)) if and only if 
r(p(s), {s,gg')) = t{p{s), {s,g')) for every g' S Gs, and hence Kg is a closed 
subgroup of Gg for almost every s. 

Also, we have 

A := {{s,g) e Z^xG, : g e K,} = {{s,g) G Z^xG. : t{p{s), {s, g)) = C^{p{s))}, 

and so this is a Borel subset of Zq x G,. Letting U be the repository, it follows 
that for any open Vi, V2, . . . ,Vk ^ U and closed W Q U v/e have 

{seZ^ : n / Vi < & i^, n = 0} 

= ^(^ n f]{z^ X Vi))\9{A n (z^ X w)), 

i<k 

(recalling that 9 is the projection onto the first coordinate) and so from the Measur- 
able Selector Theorem 12.21 this is a universally measurable set. Allowing Vi, V2, 
. . . , Vfe and U \W to run over all finite strings drawn from some countable col- 
lection of open subsets of U that generates the whole topology, we deduce that the 
map s h-^ Kgis universally measurable, and so after modifying it on a z/-negligible 
set if necessary we may assume it is Borel. 

Finally, it also follows from the fact that r(p(s), {s, gg')) = t{p{s), {s, g')) for ev- 
ery g' G Gs and g ^ Kg that the map (s, g') ^ t{p{s), (s, g')) virtually factorizes 
through the canonical factor 

Z^ X G. ^ Z^ X K,\G, 

to leave a map 

a:Z^x K,\G, ^ Zj, 

and that this is injective away from some negligible set, since if a{si, Kg^gi) = 
a{s2,Ks^g2) then si = ^(a(si,c/i)) = ^(0(52,52)) = S2 and now 

t{p{si), (si,5i)) = t{p{si), (si, (515-^^)5-2)) = t{p{si), (51,52)), 

which implies that 515^^ G Kg^ provided si did not lie in the negligible subset of 
Zq on which we modified K, above. By another appeal to Theorem 12.21 the map 
a has a Borel virtual inverse, say 

(5 : Z'^ ^ Zl X K,\G,. 
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Now (f) := /3 o is a coordinatization of Zq for which the desired diagram is 
commutative. 

(2) From the construction of /? and another measurable selection there is a 
measurable map u lY ^ Kt\G, such that 

<l>iy,9) = <l>{ro{{y,lG^s, JACoiy),9))) = iC^ {y),u{y)9); 

composing u with a measurable selector Y x K,\G, ^YtKG, gives a measurable 
map b :Y ^ G, such that (l){y, g) = [CQ{y),K^S(^y-^b{y)g) almost everywhere. 

(3) Since (j) is T-invariant we have 

-5/ 



{Coiy),K^s^^^biS''y)pi^,y)lG^,J 



{C^{S^y),K^s^s.y)b{S'y)p{i,y)lG^s.s.J 



= 4>{S'^y,p{i^y)^G^s^J 

= my,g)) = 4>{y,g) = {Ci{y),K^s^y)b{y)lG^^J 

z/-almost surely for any 7 G F, from which the required cohomology condition 
follows at once. 

(4) Suppose that K', and b' have the asserted properties, and let U be the overall 
repository for our compact group data. Then the map 

y X G. ^ Close/ : {y,g) ^ K'^s^y^b' {y)g 

is a measurable map into a standard Borel space, and it is /i-almost surely T- 
invariant from the property that b' {S'^ y) , y)b' (y) lies almost surely in K'^s^^y 
Therefore, because (j) coordinatizes the T-invariant factor, we know that there is a 
Borel map f : Z§ K (K,\G,) Clos U such that 

^s^y/{y)g = f{C^{y),K^s^y)b{y)g) 

for /i-almost every (y, g). In particular, it follows that for z/-almost every y, it is the 
case that for i^^G^s^ ^ -almost every g G G^^s^y-^ and ^^(y)-!^^^^ ^i,(^yy almost every 

h € b{y)~^K^s^y^b{y) we have 

f{C^{y),K^s^y)biy)g) = f{C^{y),K^s^y^b{y)hg), 

and hence 

K'^s^^^b'{y)g = ^s^^^b'{y)hg =, K'^sJ{y) = K'^s^^^b\y)h. 
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Since {g, h) were chosen arbitrarily from a Haar-conegligible subset of G(S^y) x 
b{y)~^K^s^y-^b{y), it follows that for /x-almost every y we have 

b{y)b'{y)-' ■ K[s^y) . {b{y)b'{yr'r' ^ ^Qf fe)" 

This tells us that the set 

{{s, c)eZl kG,: c-K'^-c-^ > Ks} 

(which is Borel for Borel versions of the measurable assignments K, and K',) 
has nonempty fibre above (Co^)^z^-almost every s € Zq, and so letting c be a 
measurable selector for this set completes the argument. 

(5) Finally, observe that for almost every {s,Ksg') that parameterizes a T- 
ergodic component we have 4>{y,g) = {s,Ksg') if and only if C^(y) = s and 
Ksb{y)g = Ksg' , hence if and only if 5 S b{y)^^ Ksg' . From this the last conclu- 
sion follows at once. □ 

Definition 4.3 (Mackey data). We refer to the measurable group data Kg given by 
the above theorem as Mackey group data of p over {Y, v, S), and to the section b 
as a Mackey section ( note that in general the Mackey group data is not unique, but 
is so up to S -invariant conjugacy, by part (4) of the theorem). 

From the above result for extensions by group data we can easily generalize to 
extensions by homogeneous space data. 

Corollary 4.4. Suppose that {X,fi,T) = {Y,v,S) k {G,/ H,,mQ^/H^, p), Co : 

Y — > Zq a coordinatization of the base isotropy factor and P : Zq — ^ Y a 
version of the disintegration of u over (^q. Then there are subgroup data K, < G, 
and a cocycle-section b : Y ^ G, such that the factor map 

: X ^ X {K,\G,/H.) : {v^gH^s^y^) ^ {(^ (y), K^s^y)b{y)gH^s^y)) 

is a coordinatization of the isotropy factor (^q : X —>■ Zq, and the probability 
kernel 

{s,Ksg'Hs) ^^ P(s, •) X m^,^,yiK^g'Hs/Hs 
is a version of the disintegration of fi over (j), where for any subset S Gs we 
write S/Hs := {gHg : g e S). 

Proof Let (X', /x', T') := {Y, v, S) k {G„p) and tt : {X\ /x', T') {X, p, T) 
the covering factor map, and now let K,, b and (j)' be given by applying Theo- 
rem l4.1l to the canonical factor map vr' : {X' , n' , T') {Y, v, S). Let (j) be the map 
given by the above formula. 
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Since (p is manifestly T-invariant we need only show that it recoordinatizes the 
whole of Zq , and not a properly smaller factor. If / G L°°{fi) is T-invariant then 
/ o TT is T'-invariant, and so factorizes through the map cp' given by Theorem 14. II 
It follows that / virtually virtually factorizes through 0. □ 

4.2 More general lifted measures on homogeneous space extensions 

Theorem 14.11 describes the components of the ergodic decomposition of /i x mo, 
under S tK p, but in fact the same ideas can be used to describe all relatively (S x /))- 
ergodic lifts of u. This stronger result, and its corollary for extensions by homoge- 
neous space data, will be important for both the inverse theory to be developed in 
the next section and the study of automorphisms of extensions in the section after 
that. 

We first set up some simple enabling results concerning the collection of lifts of a 
given probability measure on an extension by homogeneous space data to measures 
on a covering extension by group data. 

Definition 4.5 (Lift topology). Suppose that S : T r\ (Y, v), G,/H, is S -invariant 
measurable homogeneous space data with repository U, p : T x Y ^ G, is a 
cocycle-section and p is an (S x p)-invariant probability measure onY t< G,/H, 
that lifts p, and let Q be the set of all further lifts of p to {S k p)-invariant probabil- 
ity measures onY k G,. We define the lift topology on Q as the weakest topology 
with respect to which the evaluation functionals 

/^'^ / U(2/)-/(5)/^'(d(y,5)) 
JY<kG. 

are continuous for all A E Sy and continuous functions f : U ^M. 
The following is now routine. 

Lemma 4.6. Under the lift topology Q is a nonempty compact convex set. 

Proof It is easy to witness one member of Q (and so see that it is nonempty): 

:= / rngHyP{d{y,gHy)). 
JvtxG./H. 

Convexity is obvious, so we need only verify compactness. However, Q is a closed 
subset of the larger convex set Qq containing all lifts of ptoY x U, and this set is 
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easily seen to be a closed subset of the unit ball of L°° {^■, M.{U)) (where M.{U) 
is the Banach space of signed measures on U) in the weak* topology. Now the 
Banach-Alaoglu Theorem tells us that this larger set Qq is compact and the proof 
is complete. □ 

Remark Of course, it is easy to construct examples of general extensions of 
Borel actions {X, T) {Y, S) for which a given invariant probability measure 
on Y has no invariant extension to X (indeed, with Y = {*} any Borel action 
on X with no invariant probability measure will do). The nonemptyness assertion 
of the above lemma, though simple, is very much a consequence of the isometric 
structure of the extensions in question. <l 

Remark An alternative route to topologizing Q (to be found, for example, in 
Furstenberg's paper [23] and Glasner's book [29]) is to choose a coordinatization 
of (Y, u, S) as a homeomorphic action on a compact space with an invariant Borel 
probability measure, and then simply introduce the usual vague topology on our 
convex set of lifts. It is not hard to see that the resulting topology is the same; we 
have chosen the present approach only because it seems more intrinsic. <l 

We can now approach the main results of this subsection. 

Proposition 4.7. Suppose that S :T r\ {Y, v), that G, are Zq -measurable group 
data and p : T x Y ^ G, is a cocycle-section over S and that X is the space 
Y kG, but equipped with some unknown {S ix p)-invariant and relatively ergodic 
lift n ofv. Then there are subgroup data K, < G, and a section b : Y ^ G, such 
that p = V K mf,(^,yij^^. 

Remark Once again, the case with S ergodic is classical: it can be found as 
Theorem 3.26 in Glasner ll29l . < 

Proof Let T := S x p and consider again the extension of the Borel system 
T :r r\X given by 

ro-.Xx^^s^^^o} {zIkG.)^X. 

In addition, let P : Zq — > X be a version of the disintegration of the unknown 
lift /i over o vr. Now let 

K ■■= {g^Gs-. To{P{s, ■),{s,g)) = P{s, •)}; 

this is a closed subgroup of Gg that is universally measurable in s by just the same 
argument as in part (1) of the proof of Theorem 14. 1[ and is such that 

To(P(s, ■),{s,gg')) = To{P{s, ■),{s,g')) 
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whenever g' E Gs and g £ Ki.. Adjusting K', on a negligible subset of Y so that 
it is Borel, we still obtain that the composed kernel {s,g) tq{P{s, ■ ), {s,g)) 
virtually factorizes through the canonical factor Zq k G, —>■ Zq x K',\G, to a 
kernel P' : Z^ k K',\G, ^ X such that P'((s, K'^), ■) = P{s, ■ ). 

Now define another kernel P" : Zq ^ X by 

P"is,A):= [ P'{{s,K',g),A)diKlg), A G S^. 
Jks\Gs 

This is an 'averaged out' version of P'. It is also clearly 5-invariant, satisfies 
TT^P^Co #^ — ^' ^^'^ ^l^o satisfies 

To{P"{s, ■ ), {s,g)) = P"{s, ■ ) for all g G G,; 

hence P'^Qq must simply be equal to x mc,- 

Now, the measure P'{{s, K'^g), ■ ) is T-ergodic for almost every {s, g) (since it is a 
fibrewise right-translate of P(s, • ) by some fixed element of Gs), and so it follows 
that the integral 

^^mG.=P'iCl#y= [ [ P'i{s,Kig),.)d{Kig),.{ds) 

is a version of the ergodic decomposition of x mc- By Theorem 14.11 ^^ must 
be a version of the Mackey group data for p over (y, v, S), and hence by part (3) 
of that theorem it follows that there is a section b' : Y ^ G, such that p'{'y, y) := 

b'{S'^y) ■ /9(7, y) ■ b'{y)~^ takes values in ^^S(j^) almost surely. 

Now we finish the proof simply by applying the fibrewise recoordinatizing isomor- 
phism ij) : {y,g) '—^ {y,b'{y)g) from our original system {X,fj,,T) to the system 
{X, "S" X p'). This map ip must carry the {S x p)-ergodic decomposition of 
v X mc. to its {S X /)')-ergodic decomposition, and hence each of the measures 
P'{{s, K[.g), ■) to ergodic measures supported on the disjoint sets {s} x K'^g'. 
Since p' almost surely takes values in Ks and these components must integrate up 
to X niG., it follows that ip^P^C^ = v k TnK',g{,) for some g : Zq G,, 
and now applying to this equation and replacing K, := g{»)^^ K',g{») and 
b := b' • g gives the result. □ 

Once again, the result for extensions by group data implies a version for extensions 
by homogeneous space data through lifting to a covering group extension and then 
descending again, just as for Corollary 14.41 The proof is essentially the same, and 
so we omit it here. 
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Corollary 4.8. Suppose that S : T {Y, v), H, < G, are Zq -measurable 
group data and p : T x Y ^ G, is a cocycle-section over S and X is the space 
Y K G,/H, but equipped with some unknown (S x p)-invariant and relatively 
ergodic lift fi of v. Then there are subgroup data K, < G, on Zq and a section 
b : Y —I- G, such that p = v k miji^fyix.H./H,- ^ 

Arguing exactly as in the classical case of an ergodic base system by replacing 
some given group data G, with the Mackey group data K, and recoordinatizing 
(see Corollary 3.27 in Glasner [29] ), we obtain the following corollary. 

Corollary 4.9. Given a T-system Y = (Y, S), measurable S-invariant homo- 
geneous space data G,/K, over Y and a cocycle-section p : T xY ^ G„ and 
defining X ■.= Y x G,/ K, and T := S x p, any {S x p)-relatively ergodic lift p 
of V admits a recoordinatization 

(X,p,T)^ ^ ^YxiG'jHimG',/w.,p') 




so that the implicit covering group extension Y x iG',,mQ'^, p') — > Y is also 
relatively ergodic. □ 

During the development of the inverse theory of the next section we will use the 
preceding results in conjunction with the following elementary lemma. 

Lemma 4.10. Suppose that vr : {X, p, T) (Y, u, S) is a relatively ergodic ex- 
tension of a not-necessarily ergodic system, and that 

{X,p,T) (Z(„),^(„),r(„)) M {Y,i^,S) 

for n = 1,2, ... is a sequence of intermediate extensions that are all coordinatiz- 
able as extensions by homogeneous-space data. Then the resulting joint extension 

X Z(2) X • • • , V TT(2) V • • • )#p, r{^(^jV7r(2)V-) ^ C^^, V, S) 

is also coordinatizable as an extension by homogeneous-space data. 

Remark This is a straightforward extension of Lemma 8.4 in Furstenberg ||23]| . <l 
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Proof We know that for each n > 1 there are some -measurable homogeneous- 
space data Gn,»/Hn.,, a cocycle-section pn : T x Y Gn,m and a coordinatizing 
isomorphism 

"n : (2'(„),/i(„),r(„)) ^ {Y,u,S) IX (G„,,/F„,,,mG„_.///„,.Pn)- 

Each an07T(^n) is a factor map of X that gives a recoordinatization of the factor asso- 
ciated to 7r(„) and takes the form (vr, On) for a suitable map 0„ : X ^ Gn,»/Hn^,. 
Now we simply set G, := Y\n>i ^n,,, H, := n„>i Hn,„ p := {pn)n>i and 
:= {9n)n>i', it is clear that the map (vr, 9) now gives the desired recoordinatiza- 
tion of X Z(2) X • • • , (7r(i) V 7r(2) V • • • Tl^^^^vTrfajV-) as an extension 
of Y by homogeneous-space data. □ 



5 Relative weak non-mixing and isometric extensions of 
non-ergodic systems 

In this section we shall recount the main results of our non-ergodic version of 
the Furstenberg-Zimmer inverse theory. Although it seems that these non-ergodic 
analogs do not formally follow from their ergodic predecessors, their proofs largely 
follow the original arguments of Furstenberg and Zimmer, with a few judicious 
invocations of measurable selectors along the way. For this reason our presentation 
here, as in the preceding section, will be quite terse. The original papers |[50l 
[49l and ll23l remain clear and thorough references for the classical results in the 
presence of ergodicity, and we direct the reader to these for many of the original 
ideas. 

The theory developed by Furstenberg and Zimmer considers an ergodic extension 
of an ergodic system Y = {Y,u,S). Given such an extension vr : X — Y, 
this theory gives an account of the possible failure of ergodicity of the relatively 
independent self -joining X x^r X (that is, of the 'relative weak mixing' of X over 
vr): it turns out that this occurs if and only if the extension contains a nontrivial 
subextension that can be coordinatized as a homogeneous skew-product. It is this 
result that we shall presently extend by dropping the assumption that S be ergodic, 
and by working instead with extension by (possibly variable) homogeneous space 
data. Note, however, that we will continue to assume relative ergodicity of X ^ 
Y: if this fails then the arguments that follow derail quite quickly, and the best 
account of the structure of the extension that can be given in this case seems to 
result from simply considering the relatively invariant subextension first, and then 
working with the remaining (necessarily relatively ergodic) extension over that. 
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In fact, here as in the ergodic setting just a little extra work will show that once 
the failure of relative weak mixing is understood in this way, the same structures 
account for the non-ergodicity of other relatively independent joinings: given two 
extensions vTj : Xj = {Xi,fj,i,Ti) — > Yj, i = 1,2, a joining u of Yi and Y2 
and a lift /i of 1/ to a joining of Xi and X2 under which the copies of these two 
factors are relatively independent over the copies of Yi and Y2, then this larger 
joining fi can fail to be (Ti x r2)-ergodic only if each of the extensions Xj —>■ 
Yi contains a nontrivial subextension that is coordinatizable as a homogeneous 
skew-product, and the homogeneous spaces and cocycles of these skew-product 
are suitably related to each other. The result above simply corresponds to the case 
Xi = X2, Yi = Y2 and A the diagonal self-joining of the smaller system. 

5.1 Generalized eigenfunctions and finite-rank modules 

Key to the reduction from the failure of relative weak mixing to nontrivial exten- 
sions by homogeneous space data are the notions of finite rank modules and iso- 
metric extensions. These definitions are taken almost unchanged from the papers 
of Furstenberg [23] and Zimmer 149] . 

Definition 5.1 (Modules over factors and their rank). If vr : {X, fi) — > {Y, v) is 

an extension of standard Borel probability spaces and dJt is a closed subspace of 
(/i), then we shall refer to Was a n-module ifihon)-/ € Tl whenever f £ Wl 
and h € L°°{i^). 

If Jy i^(dy) is the direct integral decomposition of L'^{^) over vr, then subor- 
dinate to this we may form the direct integral decomposition J® Wy z^(dy) o/SDT 
over vr; and now we shall write that 9Jt has rank r over irifr € {1,2, ... ,00} is 
minimal such that dimdJty < rfor v-almost every y. Ifr <oowe shall write that 
9Jt has finite ir-rank. 

Definition 5.2 (Isometric extension). A system extension vr : X — Y isometric 

if L'^{h) is generated as tt -module by its finite rank T -invariant -K-submodules. 

The following first step towards representing finite rank modules will be crucial. 

Lemma 5.3 (Orthonormal basis for a module). Suppose that is a rank-r vr- 
module for some r < 00. Then there is a tuple (pi, (j)2, ■ ■ ■ , 4>r of functions in 
none of them vanishing everywhere, and a measurable function R : Y ^ 
{1,2,..., r} such that E^((/)j • (pj \ vr) = 5jjl|j j<^} and 

{L^{u) o vr) • 01 + • • • + {L^{u) o^).<p. 
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is L^-dense in 



Proof Let R{y) := dimTly and decompose Y as Us<r{y • ^iv) — 
is easy to check that R must be measurable, and the existence of suitable 0j now 
follows just as in the classical case by considering the cells {y : R{y) = s} 
separately: see Lemma 9.4 of Glasner |[29l . It is clear that none of the (pi can 
vanish everywhere, else the module Tl would actually have rank at most r — 1, 
contradicting our assumptions. □ 

Definition 5.4. We refer to the function R above as the local rank function of the 
module 97t. <] 



The following lemma is trivial, but it will prove convenient to be able to call on it 
explicitly. 

Lemma 5.5. If {X, fi) {Z, 9) ^ {Y, v) is a tower of probability-preserving 
maps and 9Jl < L'^ifJ^) is a (vr' o iT)-module of rank r, then L°°(9) ■ DJl is a vr- 
module of rank at most r. □ 



5.2 The non-ergodic Furstenberg-Zimmer inverse theorem 

We now present our non-ergodic extension of the Furstenberg-Zimmer inverse the- 
orem. This follows naturally from two separate propositions. 

Proposition 5.6 (From relative non-weak-mixing to finite rank modules). If vr : 

X — > Y 25 relatively ergodic but not relatively weakly mixing, then (/x) contains 
a nontrivial T -invariant finite-rank ir-submodule. 

Proof This first proposition is proved just as in the ergodic case, so we shall only 
sketch its proof, referring the reader to Chapter 9 in Glasner [29] or Section 7 of 
Furstenberg ll23l for a more careful treatment. Form the relatively independent self- 
product X X X with its natural coordinate projections vri , 7r2 : X x ^ X ^ X. 
Note that by construction these maps quotient to give a single factor copy of Y 
through TT, and so up to (/x (g)^ ;u)-almost-everywhere equality of functions we have 
L^(i^) o vr o vTi = L'^{v) o vr o vr2. The key idea is to choose a (T x T)-invariant 
function H that lies in L^(^ ij)\{L'^{^) o vr o vri), and then define from it a 
bounded operator A on L'^{h) by 

Alp := E^^^^ ((V; o vr2) • -ff I vri) . 
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It is easy to see that this cannot be the identity if H L^(z^) o vr o vri. Re- 
placing H{x,x') by either H{x,x') + H{x',x) or H{x,x') — iH{x',x) if nec- 
essary, we may also arrange that A be self-adjoint. In defining this A we have 
produced a 'relative Hilbert-Schmidt operator', acting as a Hilbert-Schmidt op- 
erator separately on each fibre Sjy of the Hilbert space direct integral decompo- 
sition L'^in) = Jy i^(dy). It now admits a spectral decomposition into finite- 
dimensional eigenspaces in each of these fibres, relative over the factor vr. In princi- 
ple the list of corresponding eigenvalues can vary with y £Y, but it is standai^d (see 
Section 9.3 of Glasner [29]) that they do so measurably and are S-invariant. Now 
we can simply make a measurable selection of one of these non-zero eigenvalues 
over each Coiu) ^ associate to that eigenvalue its corresponding finite- 

dimensional eigenspace Tly, to produce a nontrivial vr-submodule 9Jt of L'^{ij.) 
such that dim Tly is S'-invariant and almost surely finite, and now truncating this by 
retaining the nontrivial space Tly only on the S'-invariant set {y : dimdJly < M} 
for some sufficiently large M gives a true finite-rank module. Its T-invariance 
follows immediately from that of H and hence of A, completing the proof. □ 

Our machinery of direct integrals of homogeneous spaces becomes necessary for 
the second stage of the argument. 

Proposition 5.7 (Coordinatization of isometric extensions). If ir : X. ^ Y is 

a relatively ergodic extension for which L?'{^j) is generated by its T -invariant 
finite-rank Ti-submodules, then there are Zq -measurable homogeneous space data 
G,/K, with fibre repository U := nn>i U(n)^ and a cocycle-section p : Fxl" ^ 
G, such that 

X-^ = K {G,/K„niG./K.,p) 

canonical 

Y. ' 

Proof Let ajt^") forn > 1 be a sequence of finite-rank vr-modules generating 
L'^ijj)', since is separable we need only countably many. 

Suppose OJt'-"-* has rank r and l&t (j)2, . . . , (pr be an orthonormal basis for it 
as guaranteed by Lemma 1531 It is easy to check that o T'^, j < r, also form 
an orthonormal basis for 971^") for each 7 e F, and so we can find a measurable 
cocycle $ : F x y ^ U(i?(»)) with values in the finite-dimensional unitary group 
such that 

(</'l|7r-i{5'^y)0^'^,- • ■ ,4'R{y)\n'^S-'y)°T'^) = "^{l , V) {<Pl\n'^y) , ■ ■ • > '/'ijfe) L"! (s/)) 
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in i^j, X • • • X i^j, for z^-almost every y (indeed, this equation serves as the defi- 
nition of $ and witnesses its measurability as a function of (7, y)). From this it 
follows that J2j<r T-invariant, and by relative ergodicity can therefore 

be factorized through vr. It must therefore equal R{n{x)) almost everywhere by 
the relative orthonormality of the (pj . 

Letting 

(P-X^ ^(-^(^)-) {Mx),4>2{x), . . . 

it follows that x ^ {'ir{x),(p{x)) is a map X ^ Y k S^^^'^"^ (where S^*""^ 
denotes the unit sphere in C^) which intertwines T with 5 x <I>. Letting ^ : 
Y g2i?(»)-i ^ measurable selection (in this case we could take it to be 
constant on the level-sets of R) and then noting that \J{R{»)) r\ S^^^'^"^ = 
U(i2(»)) /Stabu(/j(,)) {£,{*)), it follows that (vr, (p) coordinatizes a subextension of 
X — > Y as the homogeneous-space data extension Sx^ yxU(i?(«))/Stabu(_R(,))(C(»)) 
carrying some invariant measure. Finally, by Corollary 14.91 we know that this can 
be adjusted to a genuine extension by homogeneous-space data carrying the asso- 
ciated direct integral measure. 

Writing this subextension as X > Z^") Y, it is clear that L^(7r^^^) ovr^") > 
dyi^"'\ Hence the target system of the factor vr^^) V vr^^) V . . . contains every 
and so must be equivalent to the whole system X. Finally, Lemma 14.101 assures 
us that this can still be coordinatized as an extension by homogeneous space data, 
completing the proof. □ 

We should also check the converse of the preceding proposition in the non-ergodic 
setting. 

Lemma 5.8. A relatively ergodic extension by compact homogeneous space data 
X := Y X {G,/K,, ruQ^jx, , p) with canonical factor map vr : X — > Y gener- 
ated by its finite-rank ir-submodules. 

Proof Let U be the compact fibre repository for the data G,, and let ?)y := 
L'^i^Gy/Ky) for y ^Y;it is easy to check that this defines a measurable family of 
Hilbert spaces. Let L : Gy r\ S)yhe the left regular unitary action. Now for any 
continuous function ip onU the associated operators 

JGy 

clearly form a measurable family in y and are each a compact operator on S^y. 
Moreover, their eigenspaces are L^^^ -invariant, and so different measurable se- 
lections of these eigenspaces now combine to form finite-rank vr-submodules of 
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L'^{fi) = fy S)y lyidy) that are T-invariant. Finally choosing a single sequence of 
continuous mollifiers {ipn)n>i on U (that is, of continuous functions on U such 
that mi/L.Tpn in the vague topology as n — > oo), all possible measurable 

selections of eigenspaces of their associated compact operators together generate 
the whole of completing the proof. □ 

Given an arbitrary relatively ergodic extension vr : X — Y that is not relatively 
weakly mixing, Proposition 15.61 guarantees that its subextension generated by all 
finite-rank vr-submodules is nontrivial; and now applying Proposition 15.71 to this 
subextension immediately gives our full version of the inverse theorem. Combined 
with the 'direct' result of Lemma |5^ this gives the following. 

Theorem 5.9 (Furstenberg-Zimmer Theorem in the non-ergodic setting). Suppose 
that the extension vr : X ^ Y relatively ergodic. Then there is a unique maximal 
subextension X — > Z ^ Y that can be coordinatized as an extension of Y by 
homogeneous-space data, it equals the subextension generated by all finite-rank 
T-invariant n-modules, and tt fails to be relatively weakly mixing for T if and only 
if this subextension contains vr strictly. □ 

Definition 5.10 (Maximal isometric subextension). The subextension Z ^ Y 
given by the preceding theorem is the maximal isometric subextension ofn:^^ 
Y. 



Remark Let us digress to locate the need for relative ergodicity in the above 
arguments. This occurred during the proof of Proposition |5i7] when we argued that 
our orthonormal basis (pi, (1)2, (pr must have X^^x^ \(j)j{x)\'^ measurable with 
respect to vr in view of its T-invariance, and from this that we could synthesize 
from this basis a map X — > S^^*^*^~^ which together with it would lead to an 
explicit coordinatization by homogeneous space data. 

It is clear that given an extension that is not relatively ergodic, we can still derive 
some structural consequences from the failure of relative weak mixing as follows. 
A simple check shows that the argument that converts a nontrivial invariant func- 
tion on X X ^ X to a finite-rank module over vr does not require relative weak mix- 

f a 

ing, and so we can still sensibly define the subextension X — > Z — > Y generated 
by all finite-rank vr-modules. Next, any function on X that is actually T-invariant 
clearly defines a rank-1 such module, and so ^ certainly contains the factor (q V tt. 
Now, in addition, any finite-rank module 9Jt over the factor vr gives a finite-rank 
module L°°((C(f V vr)^//) • 971 over V vr. In light of this and the inverse theory 
for the relatively ergodic case, : X — > Z must actually be contained in the maxi- 
mal isometric subextension of the joint factor map (q V vr. Letting W be the target 
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system of ("^ V vr, the maximal isometric subextension of the relatively ergodic 
extension C(f V vr now is coordinatizable as an extension by compact homogeneous 
space data, and from the results of the next section it will follow that the result- 
ing subextension Z ^ W is also coordinatizable by compact homogeneous space 
data. It may, however, be properly contained in the maximal isometric subexten- 
sion of (q V vr, since there may be finite-rank modules over CcT V vr that cannot be 
obtained as above from finite-rank modules over vr. 

We suspect that more can be said in general about which subextensions of (J V 
vr can be obtained from finite-rank modules over vr, but we will not explore this 
matter further here. Note that in several previous works, such as Furstenberg and 
Katznelson's proof of multidimensional multiple recurrence in ll25l and their later 
applications of similar ideas to prove other results in density Ramsey theory in ll26l 
and [27 1, this very concrete analysis in terms of extensions by homogeneous space 
data is avoided altogether. In its place is used a much softer property of extension 
called 'relative compactness', which is also a consequence of its being generated 
by finite-rank submodules and turns out to be enough to enable a proof of the 
relevant multiple recurrence results without the more precise information offered 
by a coordinatization. We will not explore relations with this idea further here, but 
refer the reader to Furstenberg 's book ll24l for a treatment of such arguments. < 

We can now extend the following definition from the ergodic-base case. 

Definition 5.11 (Distal extensions and distal towers). Given a system extension 
vr : X ^ Y we note that C,^ \J tt coordinatizes the maximal factor of X that 
is relatively invariant over vr, and now write C,y^ : X — > Z^(X/vr) to denote 
any choice of factor map that coordinatizes the maximal isometric subextension 
of V vr. In general we define recursively an increasing transfinite sequence of 
factors (^1^ '■ X 7Fi^{^/tt) indexed by all ordinals rj by letting C^i/^r • ~^ 
Z^^^y^(X/vr) denote any choice of factor map that coordinatizes the maximal 

isometric subextension of (^^^ for each rj, and letting (^i^ := \J Cj/,r ^^^^ V 
is a limit ordinal. Note that for any fixed system X there must be some ordinal 
< uji at which this tower stabilizes. If^ is a trivial system ({*}, 5^,, idj^j) we 
simplify this notation to : ^ —>■ Z^. 

The extension vr : X ^ Y distal if\/j^ (^i^ — idx- We refer to Z^(X/vr) ^ Y 
as the maximal rj-step distal subextension of vr, and to the totally ordered collec- 
tion of all of these subextensions as the distal tower ofn. 

As suggested at the beginning of this section, our inverse theorem can quite eas- 
ily be extended to account for relative non-ergodicity of more general relatively 
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independent self-joinings. The standard proof of this in the case of ergodic base 
(Theorem 9.21 in Glasner 1291 ). which reduces this situation to that of the self- 
joining treated above, does not rely on the ergodicity of the base system and so 
carries over essentially unchanged. We only state the result here. 

Theorem 5.12. Suppose that TTi : ^ Yj are relatively ergodic extensions for 
i = 1, 2, . . . , n and that v is a joining of Yi, Y2, . . . , Y„ forming the system 
Y = (y, I/, 5) := (Yi X y2 X • • • X ^n, -S*! X S'2 X • • • X Sn). Suppose further that 
X = (X, [I, T) is similarly a joining of Xi, X2, . . . , X„ that extends v through 
the coordinatewise factor map vr : X ^ Y assembled from the TTj, and such that 
under n the coordinate projections Qj : X — > Xj are relatively independent over 
the tuple of further factors VTj o Oj. Then the intermediate factor map 

V Cf/ V . . . V C?> : X ^ Z 

whose target Z is a joining of the systems Z^'(Xj/7rj) is equivalent to '■ X — > 
Z^(X/7r). In particular, it contains the relatively invariant extension Zq V Y ^ 
Y, which may be nontrivial. □ 

We will call on this version of the inverse theorem when we come to our applica- 
tions in Section |7J 

6 Factors and automorphisms of isometric extensions 

In this subsection we examine the possible forms of factors and automorphisms 
of extensions by homogeneous space data, using as our main tool the non-ergodic 
Mackey Theory of Section HI 

6.1 Some more notation 

We first need to set up some additional notation that will help us to describe alge- 
braic transformations between the fibres Gy/Ky of our extensions. 

Our first important convention is that given a Polish group U and two compact sub- 
groups G, G' < U, we identify a continuous isomorphism <I> : G G' with its 
graph {{g, ^{g)) : g £ G} < U xU.ln view of the results of Subsection |2.4[ this 
sets up a bijective correspondence between continuous isomorphisms and compact 
subgroups M € Lat {U x U) with the property that M has first and second projec- 
tions equal to G and G' respectively and M (1 {U x {lu}) = M n ({It/} x [/) = 
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{Ic/xC/}- We write Isom(G, G') for this collection of isomorphisms, and interpret 
composition $ o $' for $ G Isom(G, G') and $' G Isom(G', G") in the obvious 
way. Note that after fixing a complete separable metric on U, the resulting strong 
topology on Isom(G, G') is in general strictly stronger than the Vietoris topol- 
ogy inherited from Lat(J7 x U), but the resulting Borel structures are the same. 
Exactly similarly we can also interpret the collection Hom(G, G') of homomor- 
phisms G G', Aut G = Isom(G, G) and End G = Hom(G, G) as collections 
of subgroups ofUxU. 

This identification made, the necessary definition follows very naturally. 

Definition 6.1 (Homomorphisms sections). Suppose that (Y, v) is a standard Borel 
probability space and that Gj^,, i = 1,2, are two different measurable fami- 
lies of compact group data over Y with compact metrizable repositories U and 
V respectively. Then a homomorphism section associated to this data is a map 

y ^ :Y ^ \jat[U x V) that is Borel for the Vietoris measurable structure on 
Lat(C/ X V) and is such that (ty G Hom(Gi^y, G2,y) for v-almost every y E Y. 
We will sometimes denote this situation by ^ : Y ^ Hom(Gi^,, G2,,). 

We extend this definition in the obvious way to epimorphism sections, isomorphism 
sections and automorphism sections. 

The benefit of formulating our notion of 'isomorphism section' as above will be- 
come clear shortly, when we use the Mackey theory in a self-joining of a given 
extension to produce a measurable family of compact subgroups of U x U that 
we can then immediately re-interpret as an isomorphism cocycle in this sense. In 
doing this we will also benefit from having the following notation. 

Definition 6.2 (Extensions by epimorphism sections). If /3 : {Yi,ui) {Y2,i'2) 
is a probability-preserving map, Gi^,/Hi^, is measurable compact homogeneous 

space data on Yifor i = 1,2 and ^ : Y ^ Hom(Gi,iy, G2,fl(y)) is a measurable 
epimorphism section such that ^y{Hi^y) C -ff2,/3(2/) almost surely then we write 
fi K for the probability-preserving map 

Y, X Gi,./ifi,. ^ ^2 X G2,./H2,. : {y,gH,,y) ^ {f3{y),%{g)H2,p^y)). 

More generally, if p : Y ^ ^2,/3(») <^ section then we will sometimes write 
^p{y) (^^^P- ^p{y)) fa^ translation of G2^p[») by left-rotation by p{y) (resp. 
right-rotation by p{y) ), and define 
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In case Hi^, = {Ic. ,} we simplify this to 

13 X (Lp(,) o$.) : {y,g) ^ {(3{y),p{y) ■ ^y{g)). 

Note that if (Yi, ui) = {Y2, 1^2) then P x = /? ix p and /? k = p t< in 
the original notation of Section [3] We introduce the above class of fibrewise trans- 
formations, together with their new notation, to help us describe more explicitly 
certain maps and joinings between isometric extensions, in the sense of the fol- 
lowing definition, and to help differentiate them from those isometric extensions 
themselves, among which these new maps serve as morphisms. 

Definition 6.3 (Fibrewise automorphism and affine recoordinatizations). Suppose 
that Y = (Y, z/, S) is a T -system, G,/K, are S -invariant compact homogeneous 
space data over Y and a : T x Y ^ G, is an ergodic cocycle-section for S, 
and that : Y ^ Aut(G,) is an S -invariant automorphism section and p : 
Y —> G, a section. Then the map R := idy k {Lp(^,) ° ) defines a 

recoordinatization 

Yx{G,/K.,mG,/K.,(r)^ ^ x {G./K'„mG,/K'.,^') 




with K', := ^,{K,) and 

a'{j,y) :=p{S^y)-'^y{a{j,y))-p{y)-' 

We refer to such a recoordinatization as a fibrewise affine recoordinatization. If 
p = Ig, then it is a fibrewise automorphism recoordinatization. 

Remarks 1. The condition that <1>, be ^-invariant is needed in order that R 
intertwine 5 x cr with another cocycle extension; without this condition the new 
transformation will in general still involve also a nontrivial automorphism-valued 
cocycle. 

2. If G, and are as above then we have /? x o = /3 x (Cop(,) o 

$,oi?p(,)), where Cop(j,) G Aut{Gy) is the inner automorphism of conjugation by 
p{y). It follows that we can always choose between expressing a fibrewise affine 
recoordinatization by using fibrewise left-rotations or fibrewise right-rotations (or 
even some mixture of the two!). In general we will prefer to write fibrewise affine 
recoordinatizations with rotation part acting on the left, since this is the form in 
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which the S'-invariance of the automorphism part <I>, is directly visible, but oc- 
casionally this ability to conjugate between the two forms will give some useful 
flexibility in how we write a fibrewise affine recoordinatization. <l 

Note that transformations of the form f3 k (^p(,) o nv U x V for compact 
Abelian groups U and V, (3 & rotation of U, p : U ^ V and <I> : ?7 — > Aut V 
are already objects of study in ergodic theory: examples arise from explicitly coor- 
dinatizing a flow on a two-step solvmanifold (see, for example, Auslander, Green 
and Hahn [2] or Starkov |'42l for background), with rather special conditions on the 
cocycles p and <I> that result (although working explicitly with such coordinatiza- 
tions would probably be a cumbersome way to handle solvflows). More generally, 
it is possible that an enlarged theory of extensions by compact homogeneous space 
data could be constructed to treat actions lifted by affine-valued cocycle compris- 
ing both fibrewise rotations and automorphisms. Certainly, Z^'-actions by affine 
maps on compact groups have recently begun to receive greater attention from er- 
godic theorists (see Schmidt IMl, in particular), and it would be interesting to know 
how far the results that are now known for such systems could be 'relativized' . In 
this paper, however, fibrewise automorphisms and affine maps will play a strictly 
auxiliary role. 



6.2 The structure theorems 

The main results of this section amount to structure theorems for factors and auto- 
morphisms of isometric extensions, generalizing the classical result (and also the 
proof) of Mentzen [37] in the ergodic case (see also earlier work of Newton [38|, 
and compare with the classical argument of Veech in B31 for his condition for an 
extension to be coordinatizable as a compact group skew-product). 

Theorem 6.4 (Relative Factor Structure Theorem). Suppose that Y,; = {Yi, Vi, Si) 
for i = 1,2 areV -systems and (3 : Yi Y2 is a factor map, that Gi^,/Hi^, are 
Si-invariant core-free homogeneous space data on Yi and that cjj : T x 1^ ^ Gj^, 
are ergodic cocycle-sections for the action Si, and let Xj = (Xi, pi, Ti) := Yj ix 
{Gi^,/Hi^,,mQ. ^/fj. ^,ai) with canonical factor (^i : Xj — > Yj. Suppose further 
that [3 admits extension to a factor map a : Xi ^ X2." 



Xi 
ft 



■X2 
6 



1 
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Then there are an Si-invariant measurable family of epimorphisms <I>, : Gi., 
C2,/3(«) such that {Hi ,) C //2,/3(») almost surely and a section p : li — > G2^p[m) 
such that a = (3 K {Lp(^,-j o $,)|^^'|, and then 

= pisjy) ■ ^s/(^i(7,y)) • piyy' 

for vi-almost all yfor all 7 G F. 

Remark When is ergodic this tells us that any extension of /3 to a factor map 
of Xi must take the form of fibrewise application of a. fixed group automorphism 
and then left-multiplication by a cocycle: this is the result of Mentzen |[37l . < 

Proof We will deduce this by considering the joining of Xi and X2 defined by 
the graph of a, proceeding in two steps. 

Step 1 Suppose first that Hi , = . } for i = 1,2. Setting A := (idxi > 
this is a Ti x T2-invariant probability measure on 

which is an extension of Yi via the natural factor map. Let us denote by tTj, i = 1,2 
the two coordinate projections Xi Xj^o^^^^^} ^2 ^i, and, slightly abusively, 
also the coordinate projections Gi^, x G2^i3[,) — > Gi^,, G2^/3(,). Since this is a 
graph joining the first coordinate projection almost surely determines the second, 
and so the joined system that results is actually isomorphic to Xi; it follows that A 
is (Ti X T2) -relatively ergodic over Yi. 

Applying Proposition 14.71 we obtain some 5i -invariant Mackey group data M, < 
Gi,, X G2^/3(,) on Y and a section fe : li — > Gi , x G2^/3(,) such that X = v ]k 
mfe(,)-ijvf.- The measure A must project onto /ij under vTj, and so 

and hence 7ri(M,) = Gi^, almost surely, and similarly tt2{M,) = G2,/3(,) almost 
surely. 

On the other hand, if Li^, and L2,» are the first and second slices of M,, then we 
see that the coordinate factors tti and tt2 are actually relatively independent over 
the further canonical factors 

(Xi,pi,ri) {Yi,vi,Si) X (Gi,,/Li,,,mGi ./Li .,0-1) 

and 

{Yi,Vi,Si) «){/3=52} {X2,IJ'2,T2) 

{Yl, 1^1,81) X (G2,;3(.)/L2,.,mG2_^(.j/L2_.,cr2 0/3). 
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Of course, under a graphical joining of a factor map such as A the second coordi- 
nate projection is almost surely determined by the first, so we must have L2,, = 
{1(^2 /3(.)} almost surely. Combined with the property of having full projections, 
this shows that M, is almost surely the graph of a measurably-varying epimor- 
phism : Gi,. G2,p{,)- 

Also, since M, has full one-dimensional projections we can multiply b by some 
M,-valued cocycle if necessary to assume that = . , p(»)). We can now 
simply read off that when a{y, g) = {/3{y),g'), then almost surely 

{9,9') = ijn, p{y)^y{m)) 

for some m G Gi^y. Hence we must have m = g, and so g' = p{y)^y{g): that 
is, a = /? x (ip(.) o as required. Finally, with this expression in hand it 
is immediate to check that the commutative diagram relating a, /?, S and T is 
equivalent to the requirement that 

p{S^'{y)) ■ %{<J^{i,y)) = a2(7,/3(y)) • p{y) 
for i^i-almost every y for all 7 £ T, which re-arranges into the equation stated. 

Step 2 Now consider the case of general core-free Hi^, and let A be the graphi- 
cal self-joining given previously, so as in Step 1 the system {XiY. ^pa^_^^^.^}X2, A, Ti x 
T2) has a natural factor isomorphic to Yi and is isomorphic to Xi through the first 
coordinate projection, which also virtually determines the second coordinate pro- 
jection. 

Now in addition let Xj — > Xj be the implied covering group-data extensions, 
and let A be any relatively ergodic lift of A to an invariant joining of Xi and X2. 
Arguing as in Step 1 now gives Mackey group data M, < Gi^, x G2^/3(,) and a 
section 6 : Yi ^ Gi^, x G2,/3(,) such that 

Since the cocycles ai are ergodic, it follows as before that M, has full one-dimensional 
projections. The condition that the first coordinate almost surely determine the sec- 
ond under A becomes more subtle. Firstly, it requires that the second slice L2,. of 
M, satisfy L2,, • -f^2,/3(») = ^2,/3(»); but on the other hand Lemma [Z8] tells us that 
-^2,» ^ G2^/3(,), and by the core-free assumption -ff2,/3(») does not contain any non- 
trivial normal subgroup, so in fact we must still have L2,, = {1^2 ;3(.) }• It follows 
that M, still defines the graph of an epimorphism <I>, : Gi^, — > G2^/3(,). Secondly, 
this same condition on the determination of the second coordinate requires that 
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have a well-defined quotient between the spaces Gi^,/Hi^, and G2^/3(,)/i^2,/3(«)5 
and hence that C i?2,/3(.) almost surely. 

This leads as before to the expression P x (ip(,) o ^ for Q;> and the cocy- 

cle equation also follows as before from the condition that A is Ti x T2-invariant 
(equivalent to the intertwining property in the previous case). □ 

Specializing the above now gives a structure theorem for groups of automorphisms 
of an extension. 

Theorem 6.5 (Relative Automorphism Structure Theorem). Suppose that Y = 
(Y, u, S) is a T -system, that G,/H, are S -invariant core-free homogeneous space 
data on Y and that a : T x Y ^ G, is an ergodic cocycle-section for the action 
S, and let X = {X,ii,T) := Y x {G,/ H,,mQ^/ff^, a). Suppose further that 
A is a discrete group and R : A r\ {X, fi) is another action that commutes with 
T and respects the canonical factor vr : X Y (so it defines an action of A 
by automorphisms of the extension it). Then for each /i € A there are an S- 
invariant measurable family of isomorphisms ^h,» '■ G» Gj^^h^,^ such that 
^h,miH,) = -f/^/j|;i(,) almost surely and a section : Y Gp^^hf^,^ such that 

for each h € A, and then 

• we have 

a(7, Rtiv)) = PhiS^y) ■ <^h,yi^{i, y)) ■ Ph{y)-' 
for v-almost all yfor all ^ and /i € A, and 

• we have 

and 

for v-almost all yfor all hi, 1x2 € A. 

Proof Consider first for some fixed h £ A. Treating R^ as a factor map of X 
and applying Theorem 16.41 gives immediately the representation of R^ as 

E|^x(L,,(.)OCi>,_.)|f- 
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for some section ph and measurable family of continuous epimorphisms ^h,; and 
now the condition that actually be equivalent to idx (that is, it is not a proper 
factor map) gives that (^h , is an isomorphism and = almost 

surely. 

This establishes the existence of and <I>/i_,, and also the first of the two additional 
conclusions above. To deduce the second we need only compare the resulting 
coordinatizations of each side of the equation R^^^^ = R^^ o R^^ defining i? as a 
A-action: substituting from above this becomes 

and so we must have 



{•) 



for all g £ Gy for z^-almost every y £ Y. Since H .h-i^h2, is core-free in Gy 

^iTv [y) 

almost surely, the validity of this equation for all g £ Gy implies the two parts of 
the second additional conclusion above, completing the proof. □ 

Remark It should be possible to enhance the above theorem further by allow- 
ing an arbitrary locally compact second countable group A and imposing suitable 
continuity assumptions on the assignments h ph of measurable sections and 
h f—>- ^h,» of measurable families of isomorphisms. The additional arguments re- 
quired seem to more fiddly than enlightening, however, and so we leave the details 
to the interested reader. < 

Although Theorem 16.51 shows that the 5-ergodic fibre systems above the points 
Coiy) ^ ^0 ^i^^ Co{R\wy) S ^0 isomorphic for all /i G A for almost every 
y G y, it need not follow that these fibre systems are almost all isomorphic to 
a single model system. The following simple example has long been a part of 
ergodic-theoretic folklore. 

Example Let {Y,^) := {T'^,mj2), and form the direct integral space X := 
y >< with constant fibre (so this is really just the direct product Y xT"^) and 
measure p := mj2 7711^2. Define T : Z Xby T{y, z) := {y, y + z), so overall 
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{X,fi) is the direct integral of the individual Kronecker systems {T'^ , mj2 , Ry) , 
writing Ry for the rotation by y G T^. 

In addition, suppose that S" r\ is any ergodic toral automorphism. Then it 
is easy to check that S x S commutes with T; in particular, it carries fibres of 
the obvious factor map X ^ Y onto fibres, and so acts as an automorphism of 
the fibre system (T"^ , mj2 , Ry) onto (T^, m-j-a, However, the fibre systems 

(T^, mf2 ,Ry) are not almost all isomorphic for different y: the map 

y ^ {'^'^ , , Ry) / ~Isomorphism 

is not almost surely constant, even though it is invariant under the ergodic trans- 
formation S. This is possible because the isomorphism equivalence relation on 
the space of all Kronecker systems (suitably interpreted as pairs comprising a 
monothetic compact metrizable subgroup of a suitable fixed repository and a dis- 
tinguished element for the rotation) is non-smooth. < 

Question 6.6. Can an example be found for which the group fibres Gy themselves 
are not almost all continuously isomorphic above each R\T^-ergodic component of 

V? 

This may relate to the work of Conze and Raugy fT6l on the behaviour of measur- 
able families of (not-necessarily compact) groups related by measurable cocycles, 
but we have not been able to answer the above as a direct corollary of their work. 

The following question may also be related to the above: 

Question 6.7. Can an example be found in which for no coordinatization of the 
extension is it possible that each ^y can be extended from Gy to an automorphism 
of the whole repository group U? < 

The following corollaries concerning the extendability of automorphisms will also 
prove useful later, and may be of some independent interest. 

Corollary 6.8 (Condition for lifting an automorphism to a group-data extension). 
An action R of A by automorphisms of Y can be lifted to a A-action by automor- 
phisms of an ergodic group-data extension Y x {G,,mG,,cr) if and only if for 
every h & A the cocycle F x y — > G, x : (7,y) i-^ {a{'y,y),a{'j, R^y)) 

has relativized Mackey group data over Y that is the graph of an isomorphism 
almost everywhere, and in this case any such extended action is of the form h ^ 
R^ X (Lp^(,) o ^fi,)for some families of sections ph ■ Y ^ G, and S-invariant 
cocycles ^h,m '■ Y Isom(G,, G^h^,^) and is unique up to composition with 
an arbitrary S-invariant inner automorphism cocycle. □ 



49 



Corollary 6.9 (Automorphisms can always be lifted to core-free ergodic covering 
group extensions). Suppose that Y is a T -system, G,/H, are S -invariant core-free 
homogeneous space data and a : T x Y ^ G, is an ergodic cocycle-section. Set 
X := Y x {G,/ H,,mQ^iij^ , a) and X := X x {G,,mG,, cr). Then any action of 
a discrete group by automorphisms of the canonical extension X — > Y lifts to an 
action by automorphisms of the tower X ^ X ^ Y. 

Proof Let vr : X ^ Y and vr : X — > Y be the canonical factor maps and suppose 
that R is an automorphism of the extension vr : X — > Y. Theorem 16 . 5 1 allows us 
to write R explicitly as R\tj k (Lp(,) o $,)|^^^ ^ ^ for some p : Y ^ 
and S'-invariant $ : y ^ Isom(G,, such that <^y{Hy) = almost 

surely; and, having done this, we have that graph(<I>,) is the Mackey group data of 
the cocycle-section (7, y) 1— > ((7(7, y), (7(7, Ry)), so that 

a(7, R{y)) = h{S^y) ■ <I>,(a(7, y)) ■ KyY^ 

almost surely. This equation immediately tells us that we can lift R to the transfor- 
mation R := R\t^ k (Lp(,) o<I),) on X, and that this still commutes with T = S xa. 
Given a whole A-action of automorphisms R^, applying this argument to each 
/i € A individually and considering the consistency equations promised by Theo- 
rem |63] shows that the lifted maps still define a A-action, and hence completes the 
proof. □ 



7 Applications 

In this section we offer two closely-related applications of the theory developed 
above. 

We first study the possible joint distribution of the isotropy factors (q ' correspond- 
ing to three commuting transformations Ti, T2 and T3. This will require some 
quite cai^eful analysis in terms of Mackey group data, cocycles, and representa- 
tions given by the Relative Automorphism Structure Theorem. We will then show 
that this analysis can also be brought to bear on a detailed description of character- 
istic factors of the double nonconventional ergodic averages associated to a pair of 
commuting transformations (see, for example, [4] and the references listed there). 

Throughout this section we specialize to the setting of F := Z*^, and will write ei, 
e2, . . . , e^; for its standard basis. 
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7.1 Application to joint distributions of isotropy factors 



For a generic Z'^-action on a fixed atomless {X, ^) tiie isotropy factors (^(^ : X 
Zq corresponding to subgroups A < Z'' are all trivial (indeed, it is a classical 
result that a generic such action is totally weakly mixing). However, if they are not 
all trivial then they generate a sublattice of the lattice of all factors of (X, /x, T) 
that can exhibit some quite rich structure. 

Letting Tj := we will here consider only the further sublattice generated by 

the isotropy factors C.^' := Cq'^' ' corresponding to the possible choices of 
subset / := {ii, i2, • • • , ir} ^ M» where [d] := {1,2,..., d}. 

Clearly in general the action of each Tj for j € [d]\I on the sets of S J can still be 
quite arbitrary, and so we cannot hope to say anything about the structure of each 
isotropy factor as a system in its own right. Instead we will focus on their joint 
distribution within the original system. 

Example Let (X,/x,Ti,T2) be the Z^-system (T^,Haar, i?(c^o))-R(o,a))' where 
Rq denotes the rotation of the compact Abelian group by an element g G 
and we choose a G T irrational. In this case we have natural coordinatizations 

and similarly, since T1T2 = R(a,a)' 

It follows that in this example any two of (^q^, and Co^^'^^ are independent, but 
also that any two of them generate the whole system (and so overall independence 
fails). < 

In this section we will employ the general machinery of non-ergodic isometric ex- 
tensions and the non-ergodic Furstenberg-Zimmer and Mackey theories to describe 
this joint distribution in the case d = 3. It will turn out that these factors are al- 
ways relatively independent outside certain special 'obstruction' factors, which are 
in turn only a little more general than the above example. 

Theorem 7.1. Suppose that : "L r\ (X, ji), i = 1,2, 3, are three commuting 
actions. Then 

(1) The triple of factors Qq^' ^, Co^' ^' Co^' ^ '■^ relatively independent over C,q ; 
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(2) The triple of factors C(f\ Cjf^, Cjf^ relatively independent the further triple 
of factors 

Co^^A(Co^^vCo^^), Co'^^A(Co^^vCo'^^), Co^^A(Co^^vcJ^^). 

Theorem 7.2. We have 

an J the extension of systems 

ca« &e coordinatized as the group extension 

v^^^2.^3)(x) X (G3,.,mG3,.,(r3,ioCo'^^'^^),(r3,2oCo^^'^^r,l) 

canonical 

/or i'ome T\ t-^,t-j, T2,Tg -invariant compact group data Gs^, and cocycle-sections 

Co ^Co ' 

: Zq^' ^ ^ Gs^, and t^^2 '• ^o^' ^ ^ ^3,» <^^d similarly for the extension 

anj other permutation i,j,k of the indices 1, 2, 3 

Co'^(Co ^Co ) 

general with different group data Gi^, and cocycle-sections Tij). 

These two results together amount to Theorem 11.11 of the Introduction. Heuristi- 
cally, they assert that the joint distribution of the factor-triple (C(f')i=i f^il^ ^ be 
relatively independent over the natural candidate factor-triple (Cq" ^ V Co ' '°)i=i 
only up to single isometric extensions, and give fairly explicit coordinatizations of 
those extensions. 

It seems likely that these results could be extended with only routine modifications 
to treat a triple of commuting actions Tj : Fj r> {X, fi) of other locally com- 
pact second countable groups Fj (the key feature being that the different actions 
commute). We have restricted to a triple of Z-actions for notational simplicity. 

On the other hand, although we naturally expect Theorem 1 1.1 1 to be a special case 
of a result for larger numbers of commuting transformations (or actions), the anal- 
ysis of the corresponding isotropy factors based on the Furstenberg-Zimmer and 
Mackey theories becomes quickly much more complicated, and we shall not pur- 
sue this generalization any further at present. 
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Observe also that while Theorem W2\ describes the structure of each system (cj' A 
(C? V Cj'''))(X) as an extension of C^"^^ V C^"'^*=(X) (which, by Theorem O 
is itself just a relatively independent joining of Zq" ^ and Zq" * over Zq ), it does 

T T 

not describe the joint distribution of the factor maps Co' A (Cq^ V Co'')- This would 
require a further analysis, using the relative independence of the isotropy factors 
Zq"^^ over Zq to understand first the joint distribution of the C(f"^^ V Cq"'^^(X.) 
and then working upwards, and would proceed using very similar ideas to those 
below but with relatively smaller returns; we omit the details. 

Our basic approach rests on an appeal to the Furstenberg-Zimmer inverse theory 
to reduce the problem to the study of certain isometric extensions, followed by a 
detailed analysis of the possible structure of an associated Mackey group to obtain 
finer information about these extensions. 

This strategy is already well-established in the literature from studies of other 
questions working under more restrictive ergodicity assumptions. Indeed, Fursten- 
berg's original paper 1231 developing an ergodic-theoretic approach to Szemeredi's 
Theorem, for which much of the abovementioned machinery was originally de- 
veloped, uses similar ideas to analyze the structure of a certain self-joining of a 
given ergodic Z-system on route to the proof of multiple recurrence. That paper 
has since lead to a considerably more detailed study of the 'nonconventional er- 
godic averages' that appear in this connexion, which we will revisit in the next 
section |[l3l[Il[l5l|46lll8l[32l|48l. 

In addition, Rudolph has given in f39l an analysis of a different question rather 
more closely related to the study of isotropy factors: he obtains a description of the 
possible eigenfunctions of the product system SxTr\{YxX,v"S>fJ-) built from 
ergodic transformations S nv {Y,u) and T r\ {X,fi), effectively using for the 
earlier stages of his work a special case of the analysis to be given below applied 
to this latter product system. 

We depart from these previous works in our use of the non-ergodic versions of the 
basic machinery. In this more general setting we will find that the resulting struc- 
tures are considerably more complex, even though the description of the extensions 
ultimately obtained in Theorem 17 .2 [ involves only data Gi^, that are invariant for the 
whole action T. In particular, many of these older works have ultimately reduced 
their subjects to the study of factors that lie in a very special class of systems, the 
'pronilsy stems' (see, in particular, Rudolph's work |[39l and the papers of Host 
and Kra [32] and Ziegler [48] on nonconventional averages). Already in the cases 
considered in this section we find that we must look beyond that class. 
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7.2 Reduction to compositions of isotropy factors 

We first prove the (rather simpler) Theorem l7.1[ by effecting a quite general reduc- 
tion of the problem to the study of certain composed isotropy factors. 

Lemma 7.3. Suppose that T : T r\ {X, n) is a probability-preserving action of a 
locally compact secound countable amenable group T and that tt : (X, fi, T) — > 
(y, u, S) is a factor Then and tt are relatively independent over (^Q o vr — 

^ICo^ ° Co- 

Proof Let {In)n>i be a left-F0lner sequence in r. If A G 7r-i(Sy) and B G 
Sx is T-invariant, then 

fi{A n B) = lim I ( ^/ lT-,(A)mr{d-f)) ■lBd^^ 

N-^ooJx\mr{lN)Jij^ / 

= / E^(U I Co^ oTT)-lBdfi= / E^(U I Co o tt) ■ E^(1b I Co^ o vr) dfi, 

JX JX 

where the middle equality follows from the mean ergodic theorem. □ 

Proof of Theorem ILI] (1) If ^ G Sx is Ti- and T2 -invariant, 5 G Sx is Ti- 
and Ta-invariant and C G Sx is T2- and Ts-invariant then averaging first under T3 
gives 

/ lA■lB■lcd^x= [ E^(lA|Co^^)-lB-lc7d//= / E^{lA\Co''^''^')-Wcdi^, 

JX JX JX 

and now averaging under T2 gives 

/ ^,{lA\C|i"^'^'^')■lB■lcd^, 
JX 

= [ E^(U|C^i'^^'^^).E^(lB|Ccf^)-lcd/i 

JX 

Jx 

= / E^(U|Co^^'^^'^^).E^(lB|Co^^'^^'^^)-E^(lc|Co^^''^^''^^)dM; 
Jx 

concatenating these equalities gives the result. 

(2) This follows similarly. For this proof let V'j : = C(P A (C(f' V CiT'' ) ■ If -4 G Sx is 
Ti -invariant, B G Sx is r2-invariant and C G Sx is Ta-invariant then Lemma lTJ] 
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applied to the action Tj and the factors (^q ' and vr := (^q^ V Co gives that these are 
relatively independent over ^j, and hence that 

/ U-lB-lcdAi=/ E^(1a |-(Ai)-lB-lcd/i = / E^(1a I -(AO-E/illB I ^2)-lcd/U 
Jx Jx Jx 

= I E^(U|Vi)-E^(lB|V3)-E;.(lc|V'3)d/i, 
Jx 

as required. □ 

7.3 Some isometric extensions and their associated Mackey data 

To prove Theorem 17.21 (and so complete the proof of Theorem 11.11) we need to 
understand the structure of the composite factors Co' A (Cq^ V Co*) extensions 

of Co" ^ V Co" '''■> most of our work will go into this. As in the statement of the 
theorem we will treat the case k) = (3, 1, 2), the others being analogous. We 
will first obtain some isometricity for the extensions 

(Co^-^^ V Co'^^%T.;,^^T.,^T.) ■■ {(^'' A (Co^^ V Co^^))(x) ^ Co"^^'"^^ V Co'^'^^ix), 

and will then see a gradual extraction of finer and finer properties of these isometric 
extensions from an analysis of the associated Mackey data, with an occasional 
recoordinatization of the extensions where necessary. 

The various isotropy factors stand related as in the following commutative diagram 
(where some of the obvious maps have not been named): 




It will prove helpful to introduce some more notation. For i = 1,2 let := 

C^"^' V Cj''^' and Wi = {Wi, {ai)#n, Tj^J be its target system. 
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Lemma 7.4. We have 

/-Ti w aT2\ ^ (/-^^^^ ^^^^ ^ ^ ^^^^^^ 



Proof By Lemma|73]the system V (^"^ ) (X) is a relatively independent join- 
ing of Zq^ and Zq^ over their further factors Q^^' |^ti and I^tj. On the 

other hand the T3I -invariant functions on (^J^ V CJ^)(X) are all virtually 

Co ^Co 

measurable with respect to the maximal subextension of 

that is isometric for the restricted action of Ta, and so Theorem 15. 121 implies that 
this in turn is contained in (C^^^^ A Cq"^) V (Ci/q,2 A Co"^), as required. □ 

At this point we will introduce some new notation for the basic systems and factor 
maps under study. In addition to lightening the presentation, this will make our 
main technical results simultaneously relevant to this and the next section and so 
minimize the duplication of effort. 

We have defined := Co" ^ V Cq^' ^ with target Wj above. We define also d := 
Cf/ A Crf* and let Zj its target system, and we let Z be the target of C := Ci V C2 (a 
joining of Zi and Z2) and W be the target of a := a\\l 02 (a joining of Wi and 
W2). As usual the choice of these target systems is arbitrary up to isomorphism, 
but in this case it is natural (and notationally convenient) to pick W to be 

{Wi X 1^2,(01 VQ2)#M,rUi X T^J, 

since we will often want to discuss separately the two coordinates of a point {wi ,W2) ^ 
W. These factors are now arranged as shown: 
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X 



c 



z 




Z2 



QlICi 



w 




Wi 



W2 



Co 



■T-i_,T2 




where W and Z are actually generated by all of their exhibited factors, and the 
factors on left- and right-hand sides are relatively independent over their factor 
maps to Zq ^ '^^ . 

In this picture the transformations Tj = T'^* have the following properties: 

• Tj restricts to the identity on Zj and the factors beneath it, while acting rela- 
tively ergodically on the extension Co ^' ^ Ics-i • Zs-i — Zq^' ^, for z = 1, 2; 

• the extensions Co^^'^^Uj : Wj — > Zq^'"^^ are relatively invariant for the re- 
striction of Ts, and the extensions ctjj^. : Zj — > Wj are relatively ergodic 
and isometric for the restriction of T3. 

Our goal is to identify the Ts-invariant factor of Z (which we know is also the 
overall Ts-invariant factor by the above lemma). 

In these terms we can now state our main technical result. 

Proposition 7.5. In the situation described above, there are intermediate factors 



factorizing ai \ q such that there are T -invariant compact group data G, and cocycle- 
sections 
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a : Zq^' ^ ^ G, that is T3\ t^,t2 -relatively ergodic, 

Ti : Z?''^^ G, that is Ti\ t2,t^ -relatively ergodic and 

T2 : Zq'^''^^ G, that is T2\^Tj^,t^ -relatively ergodic 

so that we can coordinatize 



Yi- -Wi X (G.,mG.,l,(r2oCo'^^'^^|,J°P,aoCo'^i'^2|„J 



and 




and such that the T^-invariant factor ^^^^ A {Cq^ V Cq^) is contained in V ^2- 

Proof of Theorem 17.21 from Proposition 17.51 This now follows simply by un- 
packing the new notation. Let := V ^2 with target Y (a joining of Yi and Y2). 
We know that ^1 and ^2 (as factors of and (q^) are relatively independent over 
their further factors Cff^'^^lg,, i = 1,2, and hence certainly over the intermediate 
factors ai and 02, and so the coordinatizations of the extensions Qi|^. : Y, — > Wj 
by group data given by Proposition 17.51 combine to give a coordinatization of Y 
by the group data Gl and the combined cocycles. We now observe that the re- 
striction of T3 to this group data extension is described by the diagonal cocycle- 
section {a, a) corresponding to the Tsj ri.Ta -ergodic cocycle-section a, and so we 
can simply deduce that the Mackey group data can be taken to be the diagonal 
subgroup M, = {[g,g) : g £ G,}, and now the associated Mackey section is 
trivial by symmetry. This leads to the coordinatization of the T3 -invariant factor 
Co' A {Co' V C^2) = Co' AC = Co' A e as given by the location of Z^''^ in the 
following commutative diagram 
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.Wx(Gi,mG2,(r°M),(l,r°P),(a,a)) 

canonical 



^0 



(Co^^ Aa)(X) K (M.\G2,m,,.\G^KM)>(l>^2 



'■0 



canonical 



(C(r^Aa)(X) 



(where we have suppressed the need to Uft Tj through Co^~ ' Now simply ob- 
serving that the quotient M,\Gl is canonically bijective with G, under the map 
M, ■ {gi, §2) ^ di^ ■ 92 and applying this bijection fibrewise, the restricted action 
of T3 on is of course trivial and the restricted actions of Ti and T2 turn into 
the respective left- and right-actions by the cocycles ri and T2 asserted in Theo- 
rem |7i2] (where some additional subscripts '3' from the statement of that theorem 
have also been suppressed). □ 

We will prove Proposition I7.5l in several steps. First observe that since the exten- 
sions Q!j|^. : Zj — > Wj are isometric and relatively ergodic for T^Iq, the non- 
ergodic Furstenberg-Zimmer Theory of Section [5] enables us to pick coordinatiza- 
tions by core-free homogeneous space data for the (Ze3)-subactions 




(recall that ^ denotes the subaction system given by retaining only the action 
through T of the one-dimensional subgroup Ze^ < Tl?), where we may also choose 
the cocycle sections a\ to be ergodic. 

Since (i and (2 (like (q^ and Cq^) are relatively independent over Co^' ^ under fi, 
we can combine the above two coordinatizations to give 
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where G. := G''i,^,(.) x G'2_^^(.), K, := x anda := {a'^o^^a'^o 

7^2), and we here write vTj for the obvious factor map W —>■ Wi. 

Of course, we do not know that the restrictions of Ti and T2 to the factors Uil^ : 
Zj Wj are isometric, and so we have no similar coordinatization of these trans- 
formations using homogeneous space data and cocycles. We will appeal instead 
the the Relative Automorphism Structure Theorem 16.51 to describe them in terms 
of cocycles and fibrewise automorphisms. 

First, however, an appeal to Corollary I4.4l gives a first step towards the more explicit 
description of the Tsl^-invariant factor in terms of the above coordinatizations: 

Proposition 7.6. There are T^\a-invariant Mackey group data < G, on W 
and a measurable section b :W ^ G, such that the factor map 

Z ^ Zl'\- X {M:\G./K,) : 

coordinatizes the T^l^-invariant factor ofZ. □ 

The remainder of our work will go into analyzing this Mackey group data and 
section b' to deduce properties of the data {G'^ ,/ K'- ^, a'^) that gave rise to them, 
and eventually reduce them to the special form promised by Proposition 17.51 

We now prove two technical lemmas that will underly our subsequent analysis, and 
which it seems easiest to introduce separately. 

Lemma 7.7. Suppose that {X, fi, T) is a Z-system, G, is T -invariant measurable 
compact group data on X and a : X ^ G, a cocycle-section, and that (Y, v) is 
another standard Borel probability space. Suppose further that A is a (T x idy)- 
invariant joining of fi and u. If M, < G, is the Mackey group data of G, and a 
over (X, ;U, T) and t: : X xY ^ X is the coordinate projection, then the Mackey 
group data N, ofG^(^,^ and a on over {X x Y, X,T x idy) is given by M^(^,^ (up 
to a T -invariant measurable choice of conjugates) X-almost surely. 

Remark It is easy to see that N, < A/7r(»); the point to this proposition is that 
if we adjoin to {X, fx, T) a system on a new space (F, v) for which the action is 
trivial, then the Mackey group data does not become any smaller. <l 

Proof We know from Section |4] that there is a section b : X xY ^ G^i^,^ such 
that b{Tx, y)^^ ■ (t{x) ■ b{x, y) G Nf^^^y-^ for A-almost every (x, y). 
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Let A be the A-conegligible subset of X x y where this coboundary condition 
obtains, and let 

Bq ■= {{x, y) ^ A : some conjugate of N^^^^y-^ is properly contained in M^}; 

this is easily seen to be Borel and A-almost (T x idy) -invariant, and so writing 
B := n„gz^"(^o) we see that X{B) = X{Bo) and that B is strictly (T x idy)- 
invariant. It will suffice to show that B is A-negligible, so suppose otherwise. Then 
by Proposition 12.41 there are a non-negligible T-invariant subset C G T^x and a 
T-invariant measurable selector rj : C —> Y such that (x, r]{x)) € B almost surely. 
We deduce that b{Tx,r]{x))~^ ■ a{x) ■ 6(x,r/(x)) e ,^(3,)) for every x G C 
with A^(x,r;(2:)) propcrly contained in some (clearly measurably-varying) conjugate 
of the Mackey group data M^, contradicting the conjugate-minimality of this latter 
that was proved part (4) of Theorem 14. II □ 

Corollary 7.8. In the notation set up earlier in this section, we have 

{gi : 352 G G'2,. s.t. (51,52) e = G'l,. 

almost surely, and similarly for the projection of M', onto G'2 

Proof We give the argument for i = I. Simply observe that the extension 
W ^^'^ Wj''^ is relatively invariant, and so the Mackey group data for our co- 
ordinatization of the extension a|^jVo2 • (Ci V a2){'Zi)^^'^ —)■ W^^^ must simply 
be lifted from the Mackey group data for ai|^j : zj''^ — > wj''^ downstairs. Since 
the former is clearly equal to the given one-dimensional projection of M^, and the 
cocycle section a[ is assumed to be ergodic, this completes the proof. □ 

The next properties of that we deduce require a little more work. We begin 
with a useful group-theoretic lemma. 

Lemma 7.9 (Deconstructing a relation between two group correspondences). Sup- 
pose that Gi, G2 are compact groups and that Mi, M2 < Gi x G2 are two sub- 
groups that both have full one-dimensional projections, and let their one-dimensional 
slices be 

Li,i := {5 G Gi : {g,lG^) G Mi}, Li,2 := {g ^ G2 ■ {Ig,,9) G Mi} 

and similarly ^2,1, ^2,2- Suppose further that $i : Gj Gi and hi, ki G Gifor 
i = 1,2 satisfy 

Then ^i{Li^i) = L2,i fori = 1,2. 
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Proof Suppose first that {g, 1^2) ^ -^^1,1- Then the given equation tells us that 

{hi ■ $1(5) ■ki,h2- k2) = (mi, 7712) 

for some mi,m2 G M2, and in this case we have that m2 = /12 • ^2 does not depend 
on g. Since the above must certainly hold if g = Id, applying it also for any other 
g and differencing gives 

{h ■ Ma) ■ h) ■ {hi ■ • h)-^ = hi ■ $1(5) • /^r' e ^2,1, 

so <I>i(Li 1) C h^^ ■ ^2,1 • hi. An exactly symmetric argument gives the re- 
verse inclusion, so in fact ^>i(Li i) is a conjugate of L2,i. However, since Mi and 
M2 have full one-dimensional projections, by Lemma |Z8] it follows that in fact 
'l'i(^i,i) = -^2,1, as required. The case of the other coordinate is similar. □ 

Lemma 7.10. IfHi , < G'- ^^^^ are the one-dimensional slices ofM^, then 

(1) i^i,{wi,w2) Oi^fi-almost surely depends only on Wi, so after modifying on a 
negligible set we may write it as Hi^yj.; 

(2) under the above coordinatizations, for i = 1,2 the map 

Wi K {G\^JK[^,) ^W,^ {G',^J{Hi,,Kl,)) : 

iwi,gKl^J ^ {wi,gHi^^^Kl^J 

defines a factor for the whole -action T (that is, it is respected by Ti and 
T2 as well as T3). 

Proof By symmetry it suffices to treat the case i = 1 for the first conclusion 
and i = 2 for the second (it will turn out that these come together). First deduce 
from Corollary 17. 8l and Lemma lZSl that in fact i^i (^„^ ,^2) — ^i,wi ^'^^ almost every 

{wi,W2). 

We will use the presence of the additional transformations of the factors Ci and C2 
given by Ti. Of course, Ti just restricts to the identity transformation on Ci- On 
the other hand, since Ti\(^^ commutes with the transformation T^\(2 which is rela- 
tively ergodic for the extension Z2 —^ W2, the Relative Automorphism Structure 
Theorem 16.51 allows us to express 
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for some (J : W2 G2 , and T31 -invariant section : W2 Isom(G2 G2 j.^ 
satisfying <^',(i^2,») = -^2Ti| (•) ^l^io^t surely. 

Now observe from Corollary |6.9| that using the above expression and its partner for 
T2[^^ we may extend all three transformations Tj|^^, j = 1, 2, 3, to the covering 

group extension Qj : Zj ^ Zj — > Wj arising from our core-free homogeneous- 
space-data coordinatization of Tsl^., and that these extensions retain commutativity 
and all the relative invariance, ergodicity and isometricity properties listed above. 
Form the relatively independent joining 

{Co l<:«l°"l=Co ^U2°"2} 

with the coordinate projection factors back onto Zi and Z2; with the resulting fac- 
tor map onto W it now defines a covering group extension of aj^^ : Z — > W whose 
Mackey data are still and b' (by our initial construction of these). Moreover, 
these new factors Zj and Z are located in a commutative diagram with the factors 
Wj and W just as we saw previously for Zj and Z (except now not all as factors 
of the original overall system X, but of some extended overall system). 

It follows that for the purpose of proving this proposition, we may work with these 
covering group extensions throughout without disrupting the final conclusions; or, 
equivalently, that it suffices to treat the case in which the core-free kernels K', are 
trivial. Let us therefore make this assumption for the rest of this proof so as to 
lighten notation. 

Given this assumption, consider the condition that Ti|(^ respect C(f^[^ in terms of 
the above expression for Til ^2 the Mackey data. First, since has full one- 
dimensional projections we may take the Mackey section b of Proposition 17.61 to 
be of the form b^w) = {Ig^ ^_^,b'{w)). Now the above condition requires, in 
particular, that C(P |f (7i [^(z)) almost surely depend only on (^^{(^{z) for z G Z; 
and on the other hand, in terms of the above Mackey description, writing points of 
Zas {w,gi,g2) we know that Cq^\(^{w, gi, g2) = Cq-^\(;{w, g[, g'2) if and only if 

M; ■ (1,6'H) • (51,52) = AC • • {g[,g'2) 

^ {91,92) G {l,b'iw)-') ■ AC ■ il,b'iw)) ■ ig[,g'2). 

Therefore the above relation between Ti[^ and C(P|f simply asserts that for a^fi- 
almost every {wi,W2) E W, for Haar-almost every (^'i, 52) ^ ^i,wi ^ ^2,102 there 
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is some {g'{,g'^) G G^^^ x G'^^j,^^^^^^^^^ such that 

= {l,b'iwi,Ti\a^{w2))'^) ■ M'^^^^j,^^^^^^^^^ ■ {l,b'{wi,Ti\^2{w2))) ■ (91,92), 
or, re-arranging, that 

(^Gi,^l ' ^'(^1' ^1 \a2i'W2))piw2)'^l.,ib'iwi,W2y^)) 

= ^{«>l,TiU2(«,2))- 

The two desired conclusions now follow from applying Lemma|T9]to this equation 
for the two coordinate projections onto G'l and G2 . Under the first coordinate 
projection we obtain 

so -ffi,(«;i,t02) is a Ti I Q -invariant subgroup of G'l and so recalling that Ti|q is 
relatively ergodic on the extension ai\a ■ W —>■ Wi we deduce that -ffi,(u;i,w2) 
virtually a function of wi alone, as required for conclusion (1). 

For the second coordinate projection we obtain 

^U(-^2,{u;i,u.2)) = -H'2,{«)i,Ti|<,2{«;2))- 

In view of the conclusion (1) obtained above we can simplify this to 

^^2(^2,102) = H2,Ti\^^{w2)^ 

and now this is precisely the condition given by the Relative Automorphism Struc- 
ture Theorem 16.51 for Ti|^2 to respect the given map as a factor map. Since it is 
clear that the given map defines a factor for the restrictions of T2 (since this acts 
trivially on the whole of Z2) and T3 (since this acts on this extension by a G2,,- 
valued cocycle-section, and so our factor map is simply the fibrewise quotient by 
the subgroup Hi ,, recalling our assumption that K-^ is almost surely trivial), this 
completes the proof. □ 

In view of the above result, we are now able to define our desired intermediate 
factors Zj — ^ Yj — > Wj by the commutative diagrams 
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canonical 



Yi 



Wi K (Gi,,//i:i,.,mGi_./i^i_.,lGi,.,(V{») 




Wi 



and similarly for Y2, where d^, := G[^,/Hi^„ Ki^, := {Hi^,K[^,) / Hi^, and pi, 
and fjj are the appropriate quotients or restrictions of p[, , and a[: part (1) 
above gives that Hi , is correctly defined as a function on Wi, and part (2) gives 
that the above diagram defines a factor map for our whole Z^-action. 

The important feature of these new smaller extensions Yj — > Wj is that the 
Mackey group data M, of their joining under X takes a particularly simple form: 
having quotiented out the one-dimensional slices Hi ,, it is almost surely the graph 
of a continuous isomorphism. Indeed, M, is clearly obtained from simply 
by quotienting out the normal subgroup data i/^ ^ -f^2,7r2(»)' ^11^ from the 
definition of Hi , it follows that M, has full one-dimensional projections and triv- 
ial one-dimensional slices almost everywhere, and so defines almost everywhere 
the graphs of some measurably-varying Tal^ -invariant isomorphisms '^{u}i,w2) • 

Gi,wi G2,W2 - Henceforth we will refer to these as the Mackey isomorphisms. 
In addition we set 6 := 6' • ^^^2,7r2(»)' "^hat (1^^ ^^^^^ , 6) is a Mackey section of the 
extension Y^''^ "W^''^ associated to the choice of Mackey group data M,. 

On the other hand, from the description given in Proposition 17.61 it follows that 
the Ts-invariant factor is actually contained in the join of these smaller isometric 
extensions ai|^. : Yj — > Wj, and so it will suffice to study these new factors. 
The remaining steps of this subsection will give a recoordinatization of these new 
factors into the form required by Proposition 17.51 

Corollary 7.11. WehaveCo\aiiwi) = Co \a2i'W2) far a^p-almost every {wi,W2), 
and there are compact group data G, invariant far the whole action T such that we 
can recoordinatize the extensions aj|^. : Yj — Wj so that Gi^yj^ = G^t^^ (^^^ = 
G2,w2far a^fi-almost every {wi,W2)- 

Proof The first assertion is clear from the definitions. 

By Tsl^.-invariance the groups Gi^^^ actually depend only on Zi := Co^|c«i(wi) G 
Zf^*'^^, and similarly the isomorphism '^(wi,w2) depends only on the image (zi, Z2) 
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of {wi,W2)- In addition, by Theorem 17. li the coordinates zi, Z2 of this image are 
relatively independent over C(f lai(^i) = C(f |o2(^w) under a#^. 

Now let P : Zq ^ -^cf^'^^ be a probability kernel representing the disinte- 
gration of {CQ^''^^)4ifi over Co\^T2,T3. For almost every zi € zT^''^^ we can 

choose a measurable family of isomorphisms 02,^2 ■ ^'2,23 ~^ Gi^zi defined for 
P(Co^Lti,t3 (zi), ■ )-almost every Z2 € zT^''^^, because the Mackey isomorphisms 
themselves witness that these almost surely exist. Making a measurable selection 
of such a zi in each fibre of ^(f^'^^ Zq, we now take this family as defining 
a fibrewise isomorphism recoordinatization of our initial homogeneous-space-data 
coordinatization of a2\(2 ■ ^2 W2 obtained above. This has the effect of ad- 
justing to a coordinatization in which the covering group of the homogeneous space 

fibre over Z2 G Z2 is P(Co"| ti,t3 (zi), ■ )-almost everywhere equal to Gi z^, and 

Co ' 
with the kernel of the homogeneous space fibre given by 62,^2 (-^^2, 22) ^ Gi^zi- 

In particular, the covering group data of this new coordinatization depends only 

on C(f = C(f = C(f |a2 (""^2)- Exactly similarly we can now reco- 

Co 

ordinatize ai\^^ : Yi Wi to have covering fibre groups also depending only 
on Co\ai{wi) = C(f|o2(^2)- Since both these recoordinatizations are by fibre- 
wise isomorphisms that are invariant for the relevant restrictions of T3, the new 
coordinatizations of these extensions that result are still given as cocycle-section 
extensions for these restrictions of T3. Finally, in this new coordinatization the 
measurable family of Mackey isomorphisms "^(^wi,w2) clearly shows that after one 
more fibrewise recoordinatization by a T-invariant isomorphism we are left with 
the same T-invariant group data G, everywhere. □ 

Now let us re-apply the Relative Automorphism Structure Theorem 16. 5 1 to write 
and 

Taigi = T2U, K (L^2(.) o $2,.)5:;2i^^(.) 

where now is T3|a3_^ -invariant and takes values in AutG^Tj^ ^^.^ for i = 
1,2. In addition, we recall the notation Cop(,) for the fibrewise automorphism of 
some measurable group data G, given by fibrewise conjugation by a section p of 
G.. 

Proposition 7.12. The extensions Qj|g^ : Yj — > Wj can be recoordinatized by 
fibrewise ajfine transformations so that 
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(1) there are cocycles Ti : W^-i G^T|^^,) such that Lp^i^,-^ o <I>j^, = RTi{»)>' 

(2) the Mackey isomorphisms are trivial: = idc^j,^ ^ y' 

(3) the Mackey section is trivial: b = Iq ; 

(4) the cocycle Ti is invariant under TsIq. and the cocycle Ui is invariant under 

Proof This will follow from a careful consideration of the commutativity condi- 
tions relating the expressions for our three transformations on Yj. We make our 
recoordinatizations in two steps, the first by fibrewise automorphisms and the sec- 
ond by fibrewise rotations. We will construct these so as to guarantee the asserted 
properties of the ingredients p and <I>, and will then find that the asserted forms of 
^ and b are an immediate consequence. 

First observe that just as in the proof of Lemma l7.10[ we may lift all of our commut- 
ing transformations Tj\a^ to the covering group-data extensions of ai\^^ : Yj 
Wj, and have that Af, and b will still be Mackey data of their relatively independent 
joining over Zg^' ^, and therefore if we effect our desired fibrewise recoordinatiza- 
tions on these covering group-data extensions then simply quotienting will give the 
desired recoordinatizations of Qj|^. : Yj Wj. As in Lemma r/.IOI this argument 
reduces our work to the special case when K, = {1g. }■ 

The remainder of our work breaks into five steps. 

Step 1 We consider the case i = \. First recall our earlier expression of the 
fact that Ti[g respects C(f^|^: for a^/i-almost every {wi,W2) € W, setting s := 
Co"loi(^i)' we have that for Haar-almost any g' S Gs there is some g" € Gg for 
which 

(idc, X (Lp^(^2) o$i_^2))((l,6(u;i,7i;2)-^) •M(^j_^2) • {l,b{wi,W2)) ■ (1,5')) 

We can re-write this condition in terms of the Mackey isomorphisms to give 

))(•) ■^i,w2{Kwi,W2) -g) 
= b{wi,Ti\a2{w2)y^ ■ ^(«,i,Ti|„2(«,2))(*) • Kwi,Ti\o,2{w2)) ■ g", 

and now if we write 'f, := Co^,yi o and $1^, := Cop^(,) o this in turn 
becomes 
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(so we have simply shifted all the 'translation' parts of our affine transformations 
over to the right). Finally, this now clearly requires that 

almost everywhere: the automorphisms graphed by the Mackey group data have 
themselves become a coboundary for the automorphism-valued cocycle 

We will refer to the above as the 'automorphism coboundary equation' for the 
remainder of this proof. The condition that this hold for almost every {wi,W2) 
also gives nontrivial information on the automorphisms ^{wi,w2) for different wi, 
since wi is absent from the left-hand side. Since the extension a2!a : W ^ W2 is 
relatively invariant for the restrictions of Ti and T3, using Proposition 12.41 we can 
therefore choose a Ti -invariant measurable selector rj : W2 Wi so that 

^l,W2 = *(r,(i«2),TiU2{«;2)) ° ^Mw2),W2)' 

holds almost surely and so witnesses that $1,^)2 is a coboundary in Aut(Gs) for 
the transformation Ti {^2 

Naively we should now like to use the cocycle ^r)(»),« to make a fibrewise auto- 
morphism recoordinatization of the extension 02] ^2 • ^2 W2 so that the first 
of our automorphism-valued coboundary equations above gives a simplification of 
^i,w2- However, this idea runs into difficulties because the new isomorphisms 
unlike are not necessarily Ta-invariant, and so applying them fibrewise may 
disrupt the coordinatization of T^l^^ as acting by rotations. 

Step 2 The best we can do at this stage is to apply fibrewise the automorphisms 

^ivM W2) 

to our coordinatization of 02152- This gives some improvement: in the 
resulting new coordinatization of this extension, our automorphism coboundary 
equation above now reads 

^l,W2 = ^%{l,{w2),Ti\c2{w2)) ° C06(r,(w2),u.2) = ^'^b{'n{w2),Ti\^2M)~^MvM,'W2)- 

Recalling that 61 , = Cop^(,) o , this unravels to give 

^l,W2 = ^'^p^{w2) ° ^°b{r,{w2),Ti\^2{w2))-^-b{viw2),W2) 

— ^'~'pi{w2)~^-b{Tl{w2),Ti\a2(w2))~^-b{ri{w2),W2)^ 

so we conclude, in particular, that the automorphism-valued cocycle <l>i , takes 
values in the compact subgroup of inner automorphisms, and so we may represent 
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it as Co5)(,) for some TsIqj -invariant section 9 : W2 G^c^k (•)• Writing out the 
above automorphism coboundary equation in terms of 6 it becomes 

and so it we now substitute into our original expression for Ti we obtain 
Ti|^2 = Ti\a2 IX (-^piW oCoe(.)) 

= Ti\a2 IX ° COb(^(,)^T^j|^^(,))-l.;,(^(,)^,)). 

It follows that if we now make a second fibrewise recoordinatization of 02152 • 
Y2 — f W2, this time by rotating each fibre copy of Gg from the left by b{ri{w2),W2) 
(which virtually depends only on s = ^^l^j {'W2)), we are left with a resulting co- 
ordinatization of Ti in the desired form of an opposite action: 

= Ti\a2 IX -Rti(«) 

where 

Ti(.) :=6(r,(.),.)-i •6(77(.),TiU,(.)) -piH. 

Of course, both of the above recoordinatizations can be repeated analogously for 
the extension ai\^^ : Yi —>■ Wi to put T2\^i into a similar right-multiplicative 
form. 

It follows that in the coordinatizations of these extensions that we have now ob- 
tained, Ts]^. is still in the form of a cocycle-section extension Tslc^ ix (Ti (with a 
modified cocycle-section ai) and Ti and r2 are in the desired right-multipUcative 
form, as for part (1) of the proposition. 

Step 3 We now 'invert' the above implication to discover what consequences 
these improved coordinatizations imply for the data ^, and 6,. 

Recall that our first automorphism cocycle equation held for almost all {wi,W2), 
before we chose the measurable selector 77, and so in our latest coordinatization 
this tells us that 

C0ri{w2) = ^{wi,Ti\^2oi2(^2))°^lwi,W2y 

(Note that our first fibrewise recoordinatization above by automorphisms rendered 
the cocycle ^, inner at (a2)#A*-almost all the points {r]{w2),W2), which depend 
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on our choice of measurable selector r], but we have not yet seen that this cocycle is 
inner for almost all {wi,W2) as a result of this recoordinatization, hence our need 
to go back to the above form of this equation for this stage of the argument.) 

It follows that the class ^(y,^ y,^) olnn(Gs) G Out(Gs) is invariant under the action 
of idwi ^ ^1 U2 = Ti\a, and it follows similarly that it is invariant under T2\a- On 
the other hand, we have ^(wi,w2) olnn(Gs) = '^{wi,w2) olnn(Gs) and this latter is 
clearly invariant under T3I0, since it arises from the Mackey group data. Therefore 
it is actually T|Q-invariant, and so since Inn(Gs) ^ Aut(Gs) is compact, and so 
the resulting space of equivalence classes Out(Gs) is smooth, it is almost surely 
equal to ^'^ o Inn(G) for some Borel map 1', : ^ Aut(G,). Therefore one 
last fibrewise automorphism recoordinatization of 02] ^2 ■ ^2 —>■ W2 by j^,-, 
(which still does not disrupt any of the properties guaranteed previously, provided 
we replace pi and ti with ^^t|^ {m){Pi) and (•)(''"i)) now gives Mackey 

group data of the form 

M. = {(5,9o(«)55oW) : 9 G 

for some T3 [^-invariant section go : W ^ G^T|^(^,j, and now we can simply adjust 
the Mackey section b so that go = 1, and so the Mackey group data can be taken to 
be the diagonal subgroup almost everywhere. 

Step 4 Having removed all the nontrivial outer automorphisms and adjusted 
the joining Mackey group data, our automorphism coboundary equation has now 
simplified down to 

^1,W2 = Cori(.) o Co~^(,) = id = Cofe(^^^T^|^^(^2))-i.fe(^j^^2), 

and hence we deduce that • C(G^T|^(^,^), where C(G^T|^(,j) is the centre of 
G^T|^(,), is Ti I Q -invariant, and similarly that it is T2|q, -invariant. Making another 
measurable selection and recoordinatizing each fibre of a2|g2 • ^2 W2 by a 
left-rotation by b{7] {•),•) therefore preserves the structure of Ti|a as an opposite 
rotation (since the resulting additional cocycle b{wi,Ti |a2(^2))~^ ■ ^(^^i, '"^2) act- 
ing on the left takes values in C(Gs), and so may in fact be taken to act on either 
side); and after making this recoordinatization we find that the Mackey section has 
also trivialized. 

Step 5 Finally, let us look back at the relation between ai and o"2 that is im- 
plied by the cocycle equation satisfied by the Mackey data given by part (3) of 
Theorem 14. II in light of this newly-simplified Mackey group and section: this now 
becomes simply that 

0-2 (-^2) = CTliwi) 
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a#/i-almost surely, and hence in this coordinatization it follows that each cjj vir- 
tuaUy depends only on Cq^' ^ 1^^, or, equivalently, is Ts^jIq,- -invariant. Given this, 
the condition that Til ^2 ^iid ^21^2 commute simply reads that for almost every 
W2 G W2 we have 

-72(^2) -5 "^1(73 1 02 ("'2)) = (^2(Ti\a2{w2))-g-Ti{w2) = ^2 (^2) •fl'-ri (^2) V5 E Gg, 

and so we must also have that n is T3ja2 -invariant, and similarly that T2 is T^lai- 
invariant. This completes the proof. □ 

The recoordinatization of the preceding proposition leaves only one detail remain- 
ing for the proof of Proposition 17.51 

Corollary 7.13. In our homogeneous-space data coordinatizations of ai\^. : Yj — > 
Wj the core-free kernels Ki^, are almost surely trivial. 



Proof As remarked at the beginning of the preceding proof, we can lift to the 
covering group extensions and make the adjustments of Proposition 17.121 there, 
and they will then quotient back down to well-defined recoordinatizations of the 
original extensions, because at each stage we have only applied either fibrewise 
automorphism or fibrewise left-rotations. From these we have obtained expressions 

^ll6 = ^i|q2 ^ 

and similarly for T2|^j at the level of the covering group extensions, and so for 
these to have well-defined quotient it is necessary that for almost every wi all 
two-sided cosets g ■ K2,W2 ' ti{'W2) for g G G^-t^^ ^^^^ actually be left-cosets 
of K2,W2- This, in turn, requires that ri almost surely take values in the nor- 
malizer „ (^2102)' which is a closed measurably-varying subgroup of 

Cq Ia2(™2) ' 

^C^\c2M- 

Now we recall that Ti restricts to a relatively ergodic action on the extension 0^2 1 ^2 ■ 
Y2 —^ W2 — a condition we have not exploited so far — and so we must have 

almost surely, for otherwise the homogeneous space fibres of the extension 021^2 ■ 
Y2 —^ W2 would decompose into cosets of the closed subgroups Nc (K2 wo)- 

Cq \c,2(w2) ' 

K2,W2 to give additional nontrivial invariant sets under the restriction Ti\a^. How- 
ever, since 
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this requires in fact that 

almost surely, and since K2,W2 is core-free this is possible only if K2,m)2 = {1} 
almost surely. An exactly similar argument treats K2,»- D 

As remarked previously, this completes the proof of Proposition 17.51 □ 
7.4 Application to characteristic factors 

We will finish this section by offering a second application of our machinery (al- 
though in truth it is largely a corollary of the above). 

Since Furstenberg's ergodic theoretic proof of Szemeredi's Theorem in ||23]| and 
his extension with Katznelson of this result to the multi-dimensional setting in ||25l, 
considerable interest has been attracted by the 'non-conventional' ergodic averages 

N d 
n=l 1=1 

associated to a commuting d-tuple of probability -preserving transformations Ti, 
T2, . . . , : 2, r\ (X,fi), that emerge naturally in the course of those proofs. 
That these averages converge in L'^{ijl) as ^ 00 in the case d = 2 was first 
shown by Conze and Lesigne in ifTBll . and their result has since been extended in 
many directions fT4l [T5ll46l[3Tl[32l l47l|. culminating in the first proof of the fully 
general case by Tao in fl3l . We refer the reader to those papers and to ||4J for more 
thorough motivation and historical discussion of this problem. 

Conze and Lesigne's proof of convergence is comparatively soft, using only quite 
weak structural information about the above averages to show that they converge 
(in particular, using only the structure of certain finite-rank modules rather than 
their concrete coordinatizations). More recently, other convergence results for non- 
conventional ergodic averages have been based on a similar but more detailed anal- 
ysis, resting on the notion of a 'characteristic tuple of factors'. A tuple of factors 
: X ^ Yj is characteristic if 

N d N d 

n=l i=l n=l 1=1 

in L'^{ijl) as ^ cxo for any /i, /2, ■ ■ ■ , /d G Starting with the Conze- 

Lesigne proof, most convergence proofs in this area require at some stage the iden- 
tification of a characteristic tuple of factors (or a suitable finitary analog of them in 
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the case of Tao's proof) on which the restricted actions of each Tj take simplified 
forms, so that the right-hand averages above can be analyzed to prove convergence 
more easily. 

A precise description of these characteristic factors in the special case when Tj = 
T* for some fixed ergodic transformation T has now been given in terms of the spe- 
cial class of 'pronilsy stems' in work of Host and Kra |,32 ,| (see also the subsequent 
approach of Ziegler [48 ]). Frantzikinakis and Kra have extended this description to 
more general commuting tuples subject to some additional ergodicity assumption 
in II20I . but a description for arbitrary tuples of commuting transformations, with- 
out those ergodicity assumptions, seems to be more difficult. Indeed, it may be that 
no comparably clean and useful description is available in the general case. How- 
ever, at least when c? = 2 a reasonably simple coordinatization of a characteristic 
pair of factors seems to have been folklore knowledge in ergodic theory for some 
time, and in this subsection we will show how our theory enables a careful proof 
of it. 

Theorem 7.14 (Characteristic factors for double nonconventional averages). Given 
a iT'-system X = (X, /i, T\^T2), let Wj he the target system of the joined factor 
Cff" V C(f^~^^. Then X admits a characteristic pair of factors : X — > Yj that 
extend the factors Co' V Co^~ ^ ^'^'^ ^^'^ described as follows: there are T- 
invariant compact group data G„ a Ti\ ti^t2 -ergodic cocycle a : Zq^~^^ — > 



G, and a pair of Ti 
coordinatize 



-ergodic cocycles Tj : Z, 



G such that we can 



'0 



■0 





(PFi,(ai)#/x) 



with 



-Ti=T2 
'0 



and T2|^^ = 




and similarly 



(>^2,(6)#/^) 



(W2,(a2)#/^) X (G.,mG.) 





(VF2,(a2)#/i) 
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with 



"°S0 I "2 ~1°S0 I "2 



Remarks 1. The form of the coordinatizations given above with one action ex- 
tended by a cocycle and the other by an opposite cocycle, similarly to Theorem l7.2[ 
is a special feature of the case of two commuting transformations. It would be pos- 
sible to replace it with a coordinatization by homogeneous space space data (but 
not group data) in which both extensions are by cocycles acting on fibres on the 
left, for example by enlarging G, to G,xG,, quotienting by the diagonal subgroup 
{(9^9) '■ 9 ^ G,} and having p rotate G, x G, only in the first coordinate and ai, 
(72 only in the second. It is presumably this more canonical but more fiddly repre- 
sentation, if any, that would admit generalization to larger numbers of commuting 
transformations. 

2. The above result describes the possible structures of the two characteristic 
factors individually, but some opacity remains as to how they can be joined inside 
X. While we suspect that the methods of the present section can be brought to bear 
on this question also, we will not pursue this analysis in detail here. < 

The first steps of our analysis, which are essentially contained in Conze and Lesigne 
(as well as many subsequent papers; see, for example, Furstenberg and Weiss ll28l 
for a nice treatment of this stage of the proof), give control over the asymptotic 
behaviour of our averages in terms of a certain two-fold self-joining of X. We 
will then complete the proof essentially by re-applying Proposition 17.51 to certain 
factors of that self-joining. 

We observe from the mean ergodic theorem that 



where fi^ is the Furstenberg self-joining, which in this case equals fi <^ t^^t^ fJ, 



(it has a much more complicated structure for larger numbers of commuting trans- 
formations which is not yet well understood; see llH |3l for further discussion of 
this matter). It is easy to check that fjF is invariant under the lifted transformations 
T^"^ and T^'^, and also under the diagonal transformation f := Ti x T2. 

This self-joining now helps control our averages through the following conse- 
quence of the van der Corput estimate (for which see, for example, Bergelson [9]). 
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rp rp 

Lemma 7.15. If the pair of factors : X ^ Yj is such that ^ Co^~ ^ 

^0 



Co" ^ ^ ^2 then this pair is characteristic. 



Proof This follows from a routine application of the van der Corput estimate. 
Suppose that /i, /2 G clearly by symmetry and iterating our ai^gument, it 

suffices to prove that 

TV N 
n=l n=l 

in Lp'ip) as N —>■ oo, and hence (taking the difference of the two sides above) that 

N 

N 

in L^{p) if E^(/i|ei) = 0. 



1 ^ 



n=l 

'2/ 



Letting Fn := (/i o Tf )(/2 o T^), by the van der Corput estimate this will follow 
if we show that 

^ M ^ N 
h=l n=l 

as — > oo and then M — > oo. Now we simply compute 

N ^ ^ r 

Y,{Fn,Fn^,) = ^E / (/l°^")(/2or^)(/l°Tr+'^)(/2orr')d^ 
n=l n=l''^ 

N 

-'^ n=l 

^ / (/i®/2)((/iori'^)0(/2or2'^))d/, 

SO that if we now average also in h this converges by the mean ergodic theorem to 

(/l OTTi) • (/2 OTTa) • fifd/i^ 



'X2 

for some T-invariant function g. Finally, this last integral is zero if E^f(/i o 

TTi [ 7r2 V Co) = 0, and this follows from our assumptions and the relative inde- 
pendence of TTi and 7r2 over Co"^~^^ o tti under ^u^. □ 

Since on the other hand Theorem 15.121 tells us that C,q ;^ {(^'^^ ti^tj o tti) V 

1 / Cn 



{C^"^ Ti=T2 o vr2), we can deduce the following at once. 

1/Cn 



i/Co 
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Corollary 7.16 (Reduction to isometric extensions of isotropy factors). There is a 
characteristic pair of factors satisfying ^ C ' ti=t2 i = 1, 2. □ 

We can now present the factors introduced above as another instance of the sit- 
uation described before Proposition 17.51 and have that proposition do the heavy 
lifting we need again here. 

To see this, we define a system of three commuting transformations on the Fursten- 
berg self -joining. Let be the Z'^-system (X^, 81,82, 83) obtained by set- 
ting 5i := T{'^f-^ = idx X {TiT^^), 82 := T^^f-^ = (TaTf x idx and 
5*3 := T. We observe directly from the definition of /i^ that for i = 1,2 the 
coordinate projection vTj : — > X is equivalent to (q\ 

Now let Wj be (Cj'"^^ V Co'')(^) for i = 1, 2 and let Zj be the target of the max- 
imal subextension of Co^~'^^ V Cq^ : X ^ Wj that is isometric for the restriction 
of Tj. Let 

:= (Co'^^='^' V C^^) o vr, = Co"^'^^ V Co"'' : ^ 

and 

and let ( : X^ — > Z and a : X^ — > W be the joinings V C2 and qi V 02 
respectively. 

It is now routine to check from the basic results above that these data satisfy the 
same conditions as were needed for Proposition 17 . 5 1 with X^ in place of X and 8i 
in place of Tj: these factors are once again arranged as in the commutative diagram 
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where we observe easily from the structure of fi^ = jj, ® Ti=t2 /i that C,^^ ^ ~ 

Co ^~ ^ ° ^1 — Co ^~ ^ o 7r2. In addition, the transformations Si enjoy the following 
properties: 



• Si restricts to the identity on Zj and the factors beneath it, while acting rela- 
tively ergodically on the extension ^' ^|(^3_j : Za^j Zq^' ^,for« = 1,2; 

• the extensions C^^"^^|o. : Wj Zq^''^^ are relatively invariant for the re- 
striction of 53, and the extensions ai|^. : Zj Wj are relatively ergodic 
and isometric for the restriction of 5*3. 

We can therefore apply Proposition 17.51 to these systems and maps to deduce the 
following. 

Proposition 7.17. There are intermediate factors Zj — \ Yj — I Wj factorizing 
Oi \q such that there are S -invariant compact group data G, and cocycle-sections 

a : Zq^' ^ — > G, that is S3\^si,S2-relatively ergodic, 

Ti : Zq'^'^^ — > G, that is Si\ S2,s^-relatively ergodic and 

T2 : Zq'^'^^ — > G, that is S2\^Si,s^-relatively ergodic 

^0 
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so that we can coordinatize the actions of the transformations S as 




and 




and such that the {S^ = T) -invariant factor ofK^ is contained in V .^2- D 

Proof of Theorem 17.141 Again this follows simply by unpacking the notation of 
the above result: the tower of factors ai\^^ : Yj Wj of are all actually 
contained within the i*^ coordinate projection vTj : Xj, and by definition 

we have Tj = {SiSz)\-Ki for z = 1, 2, and therefore the above coordinatization of 
the action of 5 restricted to the tower of factors ai\^^ : Yi — > Wi converts into a 
coordinatization description of Ti|^j as 

{w,g) ^ (Ti\a^{w),a{Co^='^^w)) ■ g) 

and of r2|gj as 

{w,g) ^ {TiU,{w),a{Co'=''\w)) ■ g ■ T2{C^'M)), 

(since Co^' — Co^ ° "^i) ^^'^ similarly for Til^^ and T2\^2- This completes the 
proof. □ 

Remarks 1. In fact, it is relatively easy to see by checking functions /i, /2 
that are constructed from measurable selections of representative functions on the 
compact fibre groups G, that the characteristic pair of factors that we have now 
isolated is minimal, in that any other characteristic pair ^2 satisfies ;^ 

2. The results of the preceding subsection also give a precise picture of the 
T-invariant factor of X^ in terms of a diagonal Mackey group and trivial Mackey 
section for the joining of the above coordinatizations of Yi and Y2 inside X^; we 
omit these details here. < 
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8 Further questions 



This paper leaves open the obvious question of how to generalize the analysis of 
Section |7]to describe in similar detail 

• the possible joint distributions among a larger collection of isotropy factors 

Co^'^' :X-Z^'^'forri,r2,...,r,<Z^; 

• the possible structures of characteristic factors (and, relatedly, Furstenberg 
self -joinings) for larger commuting tuples of transformations (or commuting 
actions of some other fixed group). 

On the one hand, it seems likely that the machinery of extensions by measurably- 
varying compact homogeneous spaces will be quite essential to any further devel- 
opments in this area. On the other, I suspect that even the next cases to consider in 
the natural hierarchy (joint distributions of four isotropy factors, or characteristic 
factors for three commuting transformations) become much more complicated, and 
it may be in general too much to ask for the kind of precision that we obtained in 
Theorems 11.11 and 11.21 

There is an alternative viewpoint on questions such as these that may be more 
tractable. Instead of asking about exact joint distributions or characteristic factors 
for an initially-given system, if we allow ourselves the freedom to pass to any ex- 
tension of that system matters sometimes improve considerably. Indeed, the new 
proofs of convergence for linear nonconventional averages in H and of Fursten- 
berg and Katznelson's associated multidimensional multiple recurrence theorem 
in f3\ both relied on procedures for passing from an initially-given system to some 
extension in which the relevant characteristic factors and their joint distributions 
could be described much more simply. 

The constructions of those papers were abstract enough to work without any of the 
machinery of homogeneous-space-data extensions. However, for further applica- 
tions of this idea, in particular to the problem of convergence of related 'polynomial 
nonconventional averages' such as 

1 N 

-Y.^hoTf)UioTfT^) 

n=l 

(discussed, for example, by Bergelson and Leibman in lITOl ). it seems likely that 
these more delicate tools will be necessary. In the forthcoming works E IH |7l 
we will make such an analysis allowing ourselves to pass to extensions, focusing 
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on what improved characteristic factors can be found while retaining some given 
algebraic relations among the transformations involved, and will then use this to 
prove convergence of the above polynomial averages in L^(^) as ^ cxd. 

In addition to these quite specialized applications, let us also mention that there 
seem to be further issues on the general behaviour of extensions by homogeneous 
space data to be explored. For example, in [8J it is shown that some of the machin- 
ery of Furstenberg and Zimmer concerning finite-rank modules of an extension 
can be extended to the setting in which the extension is relatively finite measure- 
preserving, but the base, while ergodic, is only assumed to be non-singular (that 
is, its measure is only quasi-invariant). In that paper this machinery is needed for 
the proof of a result about the lifting of the 'multiplier property' through certain 
kinds of extension, and this will not require that these general results on finite-rank 
modules be pushed very far. However, it might be interesting to examine whether 
that development can be easily recovered without the assumption of ergodicity of 
the base, using a version of the formalism of the present paper. 
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